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"■^Let  K  b«  a  dosed  convex  set 


L  -  {P  -  <PQ . ?m)  t  Px 

the  simple  problem: 


p2  - ...  -  p. 


■  f  (*■•+ 1 > 
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0)  .1  Then  for 


Mlnlmlee 
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^  A 


Subject  to  \P_-  (Pq.Pj^,  PB)  e  KOL  , 

C.A'  <uu-  /  ..x£r  '  '  '  3  Z'"  xa'v-  /v  f  y^'lcri  K  (C.  L— 

we  prove  e  duality  theorem  end  the  convergence  of  e  solution 

elgorlthm  modeled  on  the  duality  theorem  and  the  simplex 
method  of  linear  programing  respectively. 

Specialization  of  this  general  model  to  linear  pro¬ 
graming,  convex  programing,  generalised  programing, 
control  theory,  end  the  decomposition  approach  to  mathemat¬ 
ical  programing  yield  the  appropriate  duality  theorems  and 
solution  algorithms  in  each  case.  (  ) 

The  principle  idea  exploited  here  is  the  notion  of 
supporting  hyperplanes  to  convex  sets.  Tbs  duality  theorem 
ie  a  direct  application  of  the  fact  that  every  boundary 
point  of  a  convex  set  belongs  to  s  supporting  hyperplane; 
moreover,  the  generalised  simplex  method  presented  here  is 
most  useful  when  K  is  characterised  in  such  a  way  that, 
given  a  hyperplane,  the  translate  of  it  which  Is  s  supporting 
hyper  pi  ana  of  K  may  easily  be  found.  if  it  exists,  as  well 
as  the  points  of  the  supporting  hype rp lane  which  are  comon 
to  the  boundary  of  K  .  This  corresponds  to  "pricing  out" 
in  linear  programming. 


I  1 


We  show  that  many  problems  in  control  theory  are 
special  cases  of  our  model  and  for  a  large  class  of  linear 
control  problems  a  solution  method  is  outlined  to  illustrate 
the  use  of  mathematical  programming  techniques  to  solve 
optimal  control  problems. 

Two  appendices  contain  the  elements  of  the  theory  of 
affine  spaces,  of  convex  sets,  and  of  ordinary  differential 
equations  used  in  the  text. 
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MATHEMATICAL  PROGRAMMING 
AND  OPTIMAL  CONTROL  THEORY 

by 

Richard  M.  Van  Slyke 


Introduction 


Since  the  end  of  World  War  II,  many  theories  for  the 
"optimlzatlorf  of  mathematical  systems  have  been  developed, 
among  them:  mathematical  programming,  game  theory,  decision 
theory,  optimal  control  theory,  and  dynamic  programming. 
Concurrently  an  increased  Interest  In  the  more  classical 
calculus  of  variation  has  taken  place.  Since  all  these 
methods  are  concerned  with  optimization  problems.  It  Is  natural 
to  seek  a  unified  theory  of  mathematical  optimization.  Any 
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consolidation  and  abstraction  of  mathematical  theories  Is 
based  on  a  few  unifying  ideas.  This  report  Is  based  on  two 
ideas  which  I  heartily  endorse  as  candidates  for  use  in  the 
foundation  of  a  general  theory.  The  first  is  the  supporting 
hyper-plane  theorem  for  convex  sets.  In  disguise,  this  appears, 
for  example,  In  the  duality  theory  of  mathematical  program¬ 
ming,  in  the  maximum  principle  in  control  theory,  and  as 
one  player's  strategy  in  the  theory  of  matrix  games.  The 
second  idea  arises  from  the  emphasis  on  methods  of  solution 
and  constructive  proofs  more  or  less  inherent  in  the  very 
nature  of  optimization  problems.  This  idea  is  the  simplex 
method  of  Q.  B.  Dantzig  which  so  far  is  the  most  successful 
solution  method  for  a  wide  class  of  optimization  problems. 

Just  why  it  is  as  efficient  as  it  is,  is  still  a  tantalizing 

* 

question,  but  to  date,  the  simplex  method  is  the  best  method 
for  solving  linear  programs,  and  matrix  games  and  moreover, 


* 

Probably  the  only  unsolved  question  of  theoretical  Interest 
in  linear  programming.  Essentially,  if  a  linear  program 
has  n  columns  and  m  rows,  the  solution  consists  of  search¬ 
ing  through  various  bases  to  find  one  which  yields  the  opti¬ 
mal  solution.  Using  the  upper  bound  on  the  number  of  bases 
obtained  by  assuming  that  every  collection  of  m  columns  may 
form  a  basis,  the  number  of  possible  bases  might  be  expected 

to  be  of  the  form  (m)  the  number  of  ways  n  objects  can  be 
chosen  m  at  a  time.  Experience  has  shown  that  the  number 
of  iterations  is  of  the  smaller  order  of  m  to  3m,  and  no 
convincing  explanation  of  this  curious  fact  has  been  found. 

In  1937;  W.  M  Hlrsch  [LPE,  p.l60]  conjectured  that  it  is 
possible  to  progress  from  one  feasible  basic  solution  to 
any  other  feasible  basic  solution  in  m  steps  each  inter¬ 
mediate  basic  solution  also  being  feasible.  To  date,  the 
conjecture  has  been  verified  for  ra  <  5  with  the  feasibility 
set  bounded.  If  the  feasibility  set" is  allowed  to  be 
unbounded  Klee  and  Jalkup  llOal  have  demonstrated  that  the 
conjecture  is  false  for  m  _>  4  . 


forms  an  Integral  part  of  many  non-linear  programming  schemes. 
Its  use  In  the  theory  of  optimal  control  has  been  unnecessarily 
limited,  but  part  of  the  purpose  of  this  report  Is  to  Illustrate 
the  application  of  the  simplex  method  to  optimal  control 
problems. 

The  basis  of  this  report  is  the  following  very  simple 
problem:  Suppose  y  is  a  closed  convex  subset  of  euclidean 
m+1  space,  Em+1,  and  f.  is  the  line  £  -  ((PQ,  ...,Pm)  / 

P1  “  P2  “  “*  "  Pm  “  we  wl8h  t0  mlnljniz®  P0  over  the 

set  of  all  points  p  -  (PQ, . .  ,,Pm)  e  £n  y.  Clearly  the  optimiz- 
* 

lng  point  P  is  a  boundary  point  of  y  and  is  unique  If  It 
exists.  By  a  well  known  theorem  in  the  theory  of  convex 

sets  there  exists  a  "support lng ’’  hyperplane,  y,  in  En+1 

* 

with  the  properties  that  P  belongs  to  the  hyperplane  and 
y  is  contained  entirely  in  one  of  the  closed  half  spaces 
determined  by  the  hyperplane,  y  can  be  expressed  in  terms 
of  its  normal,  ft  -  (ftQ,  ...,ftm ),  as 

y  -  (P  /  ftp  +  ft-  0}  for  some  scalar  v  .  We  remark  In  passing 
that  if  y  is  determined  by  a  differentiable  surface,  then 
ft  is  proportional  to  the  gradient  of  the  surface  at  P#. 

Another  way  of  expressing  the  above  problem  is 
(1)  Maximize  Z 

subject  to  UqZ  +  P  ■  0 

where  UQ  -  (1,0,  . . .,  0)T  . 
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The  fundamental  problem  (1)  can  be  reformulated  in 
an  equivalent  form  which  will  be  of  great  utility  for  ua. 
This  formulation  is: 


(2) 


Maximize  Z 

subject  to  UqZ  +  y  =  0 


Z\i  * 1 

(Xj  *  0) 

where  again  each  may  be  chosen  freely  from  X*  The 
equivalence  follows  from  the  convexity  of  X. 

We  then  take  the  formal  dual  of  the  above  problem  in 
the  sense  of  linear  programming,  obtaining: 

Minimize  y 

(3)  subject  to  irP  +  y  ^  0  for  all  PcX* 


If  *  0,  for  some  supporting  hyperplane  of  X  at  P 


it  is  easy  to  3ee  that  w  *  5^-  lr  and  y 

^0 


1  <v 

y  is  an  optimal 


solution  of  the  dual  problem,  and  of  course  in  the  other 


direction,  solutions  of  (3)  determine  supporting  hyper- 

* 


planes  of  X  at  P  . 

X  can  be  characterized  in  many  ways  depending  on  the 
application  to  which  our  model  is  applied.  In  many  cases 
it  is  much  easier  to  find  boundary  points  of  convex  sets 


corresponding  to  a  given  normal  9f,  than  to  find  directly 
the  "lowest”  point  in  y  on  the  line  £.  In  the  context  of 
the  simplex  method  for  linear  programming,  this  simply 
expresses  the  fact  that  "pricing  out"  is  easier  than  directly 
finding  the  solution.  When  this  property  is  present,  it  is 
natural  to  generalize  the  simplex  method  to  solve  the  funda¬ 
mental  problem.  To  illustrate  the  basic  idea  we  initially 
restrict  ourselves  to  the  case  where  y,  in  addition  to  being 
closed  and  convex,  is  bounded.  We  assume  we  start  out  with 
points  P1,...,Pm+1  cy  which  uniquely  determine  a  hyperplane 
in  Em+1,  which  is  not  parallel  to  £,  and  such  that  some 
point  Q1e£  la  in  the  convex  hull  of  P1, . .  .,Ppw’i. 


We  then  move  the  hyperplane  determined  by  P1, . . . ,  P®+1 
in  the  direction  of  decreasing  PQ  until  we  find  the  correspond* 
ing  boundary  point,  Pm+^.  if  t*  Is  the  unique  vector 
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determined  by  ir1?1  =  tt 1P2  =  ...  =  7r1Pm+1>  then  finding 
Pm+2  can  be  expressed  in  the  following  way: 

(4)  ir1Pm+2  =  Min  [tt1  P/PeX). 

This  we  call  "pricing  out",  or  solving  the  "subproblem" 
in  various  contexts. 

The  next  step  is  to  solve  (2)  for  these  particular 

P^,  J  =  l,...,m+2  no  longer  allowing  them  to  vary  over 

y.  (2)  is  then  Just  a  linear  program  which  we  can  solve 

obtaining  an  optimal  solution  over  the  convex  hull  of 

P1, ...,Pm+2.  We  call  this  solution  Q2  and  the  correspond- 

2 

ing  dual  variables,  v  ,  which  are  determined  by  the  m 

1  m+2  2  ' 

points  of  P, .. ,,p  which  are  in  the  optimal  basis,  v 

is  then  used  in  the  "subproblem"  (4)  to  generate  a  new  < 

point  p”1^.  We  repeat  the  process,  until 
Min  (m*P/ Pex)  *  Min  (ir*PJ,  J  -  1, . .  .m+ *) .  * 

We  now  consider  three  of  the  assumptions  we  have  made 

so  far.  The  first  assumption,  and  most  important,  is  that 

# 

there  is  a  supporting  hyperplane  at  P  determined  by  v,|jl 

with  if  *  0.  That  is,  a  supporting  hyperplane  not  containing 

£.  If  X  is  a  polyhedral  set,  i.e.,  the  intersection  of  a 

finite  number  of  half-spaces,  we  have  no  trouble  since 

polyhedral  sets  are  "pointed"  in  the  sense  that  the  cone 

of  normals  to  supporting  hyperplanes  at  an  extreme  point 

has  a  non-empty  interior.  For  the  general  case  when  x  » 

is  not  necessarily  polyhedral,  we  assume  that  the  intersection 

* 

V - — - - - -  - - -  ■ 

Barr  has  developed  a  similar  technique  [la]  in  the  case 
v/here  (2)  is  a  quadratic  program. 
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of  K  and  ^  contains  a  point  in  the  relative  interior  of 
K  .  If  K  and  /.  have  this  property,  we  say  that  (1)  is  regular . * 
The  next  assumption  we  consider  is  that  we  can  find  the 
points  P1,  . ..,  P®41  to  start  our  algorithm.  If  K  has  a  non¬ 
empty  interior,  regularity  implies  these  points  exist.  If 
K  has  no  interior,  then  the  whole  problem  can  be  considered 
in  a  lower  dimensional  space  where  it  has  an  interior.  The 
practical  problem  of  finding  the  points  is  solved  by  using 
a  "phase  I"  type  procedure  as  in  linear  programming.  The 
"phase  I"  process  also  Indicates  the  dimensionality  of  K  . 

This  is  discussed  at  the  end  of  Chapter  II.  The  last 
assumption  we  discuss  Is  the  assumption  that  K  is  bounded. 

For  a  regular  problem  with  K  bounded,  the  algorithm  outlined 
above  was  shown  to  converge  to  solutions  of  (1)  and  (3) 
by  0.  B.  D&ntzlg  in  the  context  of  convex  programming 
[L.P.E.,  Ch.  24];  however,  in  order  to  include  most  al¬ 
gorithms  of  mathematical  programming  of  the  simplex-type 
in  this  format,  it  is  necessary  to  allow  K  to  be  unbounded. 

In  particular,  for  the  linear  program 
Maximize  Z 
subject  to  U q?  +  Ax  -  b 

x  >  0 

we  have  K  •  {?/?  ■  Ax  -  b,  x  >  0),  which  ia  a  cone  and 
patently  unbounded.  A  primary  objective  then  la  to  extend 
the  algorithm  to  the  unbounded  caae.  After  thia  is  done 
in  Chapter  II,  we  show  that  linear  programming,  convex 

*Much  of  the  duality  theory  used  here  has  also  been  developed 
by  T.  Rockefeller  using  the  theory  of  conjugate  functions 
[13a];  see  also  [17a].  Regularity,  aa  used  here,  la  related 
to  Rockafellar's  notion  of  a  stably  aet  problem. 
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programming,  and  the  decomposition  algorithm  of  mathematical 
programming  are  all  special  cases. 

Other  special  cases  are  problems  in  control  theory. 

In  Chapter  III  we  first  establish  a  very  general  class  of 
problems  which  can  be  formulated  as  special  cases  of  (l). 
Suppose  we  have  a  system  of  ordinary  differential  equations 
in  vector  form. 

(5)  -  f(x(t),u(t),t),  x(t)  *  (x0(t),...,xn(t))cEn+1 

ucfl,  u(t)cEr 
x(0)  e  X°  t  €  [0,«] . 

For  any  time  t  >  0,  we  can  define  St(x°)  to  be  the  set  of 
points  that  can  be  reached  at  time  t  by  the  solution  of 
(5)  for  some  point  x(0)  in  X°  and  u  in  the  class  of 
"admissible  controls"  ft. 

For  the  special  case  where  ft  «  (u  /  u(t) c Q(t),  u(t) 

measurable  and  bounded}  where  Q(t)  is  a  compact  convex  set 

for  all  t,  0  <£  t  £  T  and  as  a  function  of  t  is  upper  serai- 

continuous,  and  f(x,u,t)  is  given  by  f^(x,u,t)  ■ 
m 

2^  A^(t )  x j ( t )  +  u ( t ) ,  i  »  l,...,m  and  fo(x,u,t)  is 

convex  in  (x,u),  where  each  A^  is  continuous  on  0  £  t  £  T, 
we  show  that  the  set  s£(S°)  -  {x  -  (xQ, ...,xn)  / 

*0  *  *0>  *1  “*!>  ••  •’*!)  *  v  *  *  is  closed,  convex 

and  "bounded  below." 


For  the  optimal  control  problem  of  minimizing  Xq(T) 
over  solutions  of  (5)  which  satisfy  x^(T)  ■ 
i  «  l, .  ..,n  we  may  use  the  generalized  simplex  method  if 
the  above  conditions  are  satisfied  and 
X  *  (P/P  -  x  -  (0.x* . xnT  ),  x€S*(XQ))  is  regular. 

In  practice,  the  "pricing  out"  operation  is  by  no  means 
simple,  but  in  the  case  where  f(x,u,t)  ■  A(t)  x(t)  +  u(t) 
life  becomes  much  simpler.  We  close  the  thesis  by  out¬ 
lining  solution  methods  in  this  case.  One  approach  is  to 
discretize  the  problem  in  time  and  to  solve  the  corresponding 
linear  program.  This  is  expedited  by  the  use  of  generalized 
programming  and  a  new  algorithm  [  5 ]  which  makes  use  of  the 
special  structure  of  the  resulting  linear  program. 

In  Chapter  I,  the  simplex  method  as  applied  to  linear 
programming  is  outlined  in  its  algebraic  form,  mainly  to  get 
notation  straight.  We  then  give  the  formal  details  of  the 
generalized  programming  approach  of  Wolfe  and  Dantzlg  [L.P.E. 
Ch.  22].  The  details  and  relevant  proofs  ere  deferred  to 
Chapter  II.  This  generalized  simplex  method  forms  the  compu¬ 
tational  basis  of  this  work.  Finally,  another  variant  of 
the  simplex  method,  the  generalized  upper  bounding  technique 
[  4,5  ],  is  reviewed,  first  as  an  illustration  of  some  of 
the  devices  used  to  make  the  basic  simplex  method  more 
efficient  in  special  cases  and,  second,  because  of  its 
utility  in  the  numerical  solution  of  certain  types  of  linear 
optimal  control  problems. 
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In  Chapter  II  the  fundamental  problem  Is  introduced. 

We  then  prove  the  duality  theorem  for  this  problem  and  show 
how  the  generalized  simplex  method  applied  to  this  problem 
converges  to  the  optimal  solution.  We  then  show  that  the 
ordinary  simplex  method,  convex  programming,  generalized 
programming,  the  decomposition  algorithm  for  linear  pro¬ 
gramming,  and  many  problems  of  control  theory,  are  special 
cases  of  this  problem  and  interpret  the  generalized  simplex 
method  in  each  of  these  cases. 

In  Chapter  III  the  problem  of  optimal  control  Is  treated 
in  more  detail,  first,  in  the  spirit  of  Chapter  II,  and  then 
more  mundanely,  by  discretizing  the  problem  and  solving  the 
resulting  linear  programming  problem  using  the  generalized 
upper  bounding  technique. 

Finally  there  are  several  appendices  giving  necessary 
theorems  from  other  fields  of  mathematics  with  adequate 
references  to  proofs  If  they  are  not  given  in  the  text. 

A  final  note  cn  references.  Numbers  In  square  brackets, 

* 

[  ],  will  refer  to  the  list  of  references  at  the  end  of 
this  report.  The  special  symbol  [L.P.E.]  refers  to  0.  B. 
Dantzig's  Linear  Programming  and  Extensions  [3]. 

///  denotes  the  end  of  a  proof.  Theorems,  remarks,  and 
equations  are  all  numbered  serially  on  the  same  scale.  The 
number  referring  to  the  chapter  is  dropped  if  the  reference 
is  made  in  that  chapter. 
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Chapter  I:  Some  Ideas  In  the  Theory  of  Mathematical  Programming 


0.  Some  Notation:  Matrices  will  be  denoted  by  capital 
letters,  e.g.,  A.  The  1th  row  of  A  will  be  denoted  by  A^, 
the  column  by  A^  and  the  element  In  the  1th  row  and 
Jth  column  by  A^ .  The  components  of  vectors  are  distinguished 
by  subscripts.  Superscripts  on  vectors  represent  an  enumera¬ 
tion  of  a  collection  of  vectors.  Generally  speaking,  column 
vectors  will  be  denoted  by  Latin  letters  and  row  vectors, 
by  Greek.  The  norm  |  |  x  | |  of  a  vector  x  will  be  the 


euclidean  norm 


and  the  norm 


matrix  will  be  the  usual  one 


A 


of  a 


] |  A  ||*  sup  ||  Ax  ||  where  the  norms  in  the  right 

IMIsi 

are  euclidean.  A  symbol  of  the  form  (x/A)  denotes  the 
set  of  all  x  possessing  the  property  A. 
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1.  Linear  Programming  and  the  Simplex  Method:  The  funda 
mental  problem  of  linear  programming  is:. 


Maximize  xr 


Subject  to  A®  xq  +  Aq  x1  + 


+  aq  xn 


(1.1) 


A  ^  Xq  +  A  ^  x  ^  + 


+  A”  xn 


=  b- 


xo +  xi + 


•  •  +  A^  x 
m  n 


m 


and 


xn  |  •  •  •  |  X. 


n 


£  0. 


In  terms  of  the  columns  A*^  of  A(  »  [A^]  )  an  equivalent 
formulation  of  (1.1)  Is 
Maximize  xQ 
subject  to 


(i.2)  m  b 


Associated  with  the  fundamental  problem*  is  the  dual 
problem: 


*  In  this  context,  the  fundamental  problem  is  often 
referred  to  as  the  primal  problem. 
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Minimize  +  b-r.  +  •  •  .  +  br 

u  u  i±  mm 

subject  to  AqF0  +  A ^  +  •  •  •  +  A®irm  -  1 

(1.3)  *0*0  +  aJi-j  +  •  •  •  +  **%  i  0 

•  •  I 

•  •  • 

*0*0  +  AlTrl  +  *  *  *  +  A^irm  *  C 
or  equivalently 
Minimize  irb 

(1.4)  subject  to  irA0  ■  1 

tA^  i  0  J  -  1,  .  .  .  ,  n 

A  vector  x  *  x°  is  called  a  feasible  solution  of  (1.1) 
or  (1.2)  if  x®,  .  .  .  ,  x®  £  0  and  x^A^  -  b.  x° 

is  an  optimal  solution  if  x®  la  maximal  over  all  x  which 
are  feasible.  Similar  terminology  applies  to  the  dual 
program. 

(1.2)  and  (1. 4)  (or  equivalently  (1.1)  and  (1.3))  are 
related  by  the  following  theorems: 

1.5  Theorem  (Duality):  If  feasible  solutions  to  both 
(1.2)  and  (1.4)  exist,  then  the  value.  Xq,  corresponding 
to  any  feasible  solution  of  the  primal  (1.2)  is  less  than 
or  equal  to  the  value  yb  for  any  feasible  solution  of  the 
dual  (1.4).  Furthermore,  optimal  solutions  exist  for  both 
systems  and  they  are  equal  to  a  common  value,  l.e.. 

Max  x°  -  Min  Tb. 


1.6  Theorem  (Unbounded  Solutions):  (a)  If  the  primal 
(1.2)  has  a  feasible  solution  and  the  dual  (1.4)  does  not, 
then  xQ  J  lot  bounded  above  over  the  set  of  feasible  solu- 


tions  to  primal,  i.e.,  there  Is  no  optimal  solution 


solution  and  the  primal  (1.2)  does  not,  then  7rb  Is  not 
bounded  below  over  the  set  of  feasible  solutions  to  the 
dual . 


1.7  Theorem  (Optimality  Criterion):  If  x°  and  tt°  are 
feasible  solutions  to  the  primal  (1.2)  and  dual  (1.4)  systems 


respectively  and  If  we  let:  Aj  «  ttA*^  then  x°  and  tt° 
are  respectively  optimal  solutions  to  the  primal  and  dual 


The  proofs  of  these  theorems  can  be  found  In  Chapter 
6  of  [L.P.E. ] . 

The  most  common  method  for  determining  optimal  solu¬ 
tions  is  the  simplex  method  or  one  of  its  variants.  As 
a  review  and  in  order  to  build  up  a  consistent  notation, 
we  sketch  one  of  the  variants  here. 

We  assume  in  (1.2)  that  A  is  of  rank  m  +  1.  A  basis 
for(l.2)is  a  set  of  m  +  1  linearly  Independent  columns 
A0, A'*1,  .  .  .  ,A  m  which  we  will  require  to  contain  A®.  We 
usually  denote  a  basis  by  B  where  B°  -  fP  and  B1  ■  A^1, 


I 


i  *  1, ...,m.  A  basic  solution,  x  ,  of  (L. 2) is  a  solution 
obtained  from  a  basis  B  =  [A0, A*^, . . .  ,A^ra]  by  letting 


* J  “  0 

b  * 
\m  *i 


J  *  0 


J  *  J, 


i  =  1, . .  ,,m 


1  ;  •  •  ••  j  in  | 


where  x  is  the  unique  solution  of  Bx  »  b.  Note  that  xD 
is  not  necessarily  nonnegative.  A  basic  feasible  solution 
is  a  feasible  solution  which  is  basic. 

The  following  theorem  allows  us  to  consider  only  basic 
feasible  solutions: 

1.8  Theorem:  If  there  exists  an  optimal  solution  x°  to 


.21  then  there  exists  a  basic  optimal  solution  x1. 


Proof:  Let  x  be  any  optimal  solution  with  Xq  ■  x®  and 

let  J  ■=  (  J/Xj  >  0  }  .  Either 
(AJ)j£j  forms  a  linearly  independent  set  or  there  exists 

y  such  that  ^y^  ■  0  (y^  *  0,  J  £  J). 

If  the  first  occurs,  we  are  done;  suppose  in  fact,  the 

second  possibility  occurs.  If  OcJ,  clearly  y^  ■  0  for 

otherwise  letting  Z (A)  -  x  +  Ay;  Zq  could  be  made  greater 

than  xQ  by  appropriately  choosing  X  contradicting  optimality. 

Suppose  JncJ,  and  without  loss  of  generality  that  y.  <  0. 

J0 
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Then  Z(A)  must  eventually  have  one  less  non-zero  component 
than  x  as  X  Increases.  By  repeating  this  process  a  finite 
number  of  times,  the  columns  with  non-zero  coefficients  will 
eventually  form  a  linearly  Independent  set./// 

Let  us  suppose  we  have  an  Initial  basic  feasible 
solution  x°  and  the  corresponding  basis 
o  h  Jm 

B  *  [A  , A  ,  . . . ,  A  “]. 


B  Is,  by  definition,  non-singular,  so  we  may  assume  we 
also  have  B”1,  the  Inverse  of  B.  Let  the  "prices",  ir, 

1  fk  *1 

be  given  by  t  *  (B  )A,  i.e.,  the  0  row  of  B*  .  Then 

u  1 

0  "1 

we  have  irA  «  1  and  irA  «  0,  1  -  1, ...,m.  If 

tA**  £  0  for  all  J  we  have  by  (1.7)  optimal  solutions  to 

the  primal  and  dual  problems  and  we  are  done. 

If  not,  determine  s  by 

irA8  *  min  irA^  <0  J  «  1,  ...,n. 


We  now  wish  to  form  a  new  basis,  B',  by  replacing 
one  of  the  columns  of  the  old  basis  B  by  A8  In  such  a  way 
as  to  guarantee  that  the  new  basic  solution  x'  correspond¬ 
ing  to  B1  is  feasible. 


% 


t 
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What  we  are  doing  by  introducing  A8  into  the  basis  is 
increasing  the  value  of  x.  in  the  equation  Ax  -  b  maintain- 
ing  equality  by  adjusting  the  old  basic  variables.  Let 
0  denote  the  value  of  x  and  multiply  Ax  ■  b  on  the  left 
by  B”1  (and  assume  for  notational  simplicity  that 
B  -  [A0, A1,  ...,Am])  obtaining: 

(1.9)  *0U0  +  X1D1  +  •  •  '  +  xmUm  +  W1*”*1  +  •  •  •  + 

SB"1  A8  +  .  .  .  +  xnB“1An  -  B‘1b 

A.U 

Where  is  the  unit  vector  with  1  in  the  i  component. 

Then 


(1.10)  xjL  -  (B“1b)1  -  © (B~  1AS ) ±  i-0,...,ra. 

But  £  0  i  ■  1,  ...,m  must  be  satisfied  so  we 
can  only  Increase  6  up  to 


(1.11)  e 


min 

(b*1a8)1>0 


where 


(B‘V)r  >  0. 


J_ 

Since  x.  is  now  zero,  A  r  can  be  dropped  from  the 
Jr 

basis.  If  there  is  no  r  such  that  (b”*A8)  >0  it  can  be 

shown  that  Xq  is  unbounded  above  and  increasing  xfi  gives 

a  class  of  feasible  solutions  with  arbitrarily  large  values 

of  Xq.  In  texts  on  linear  programming  it  is  shown,  if 

suitable  care  is  taken  for  cases  where  e  •  0  in  (1.11),  that  this 
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process  either  yields  an  optimal  solution  or  a  class  of  feasible 
solutions  which  are  unbounded  above  in  ^  in  a  finite  num¬ 
ber  of  steps  [L.P.E.,  Chs.  6  and  10].  To  get  the  initial 
basis  the  "Pha&e  I"  procedure  is  used.  [L.P.E.,  Ch.  5] 

1.12  Remark:  It  can  be  shown  that  any  optimization  of  a 
linear  form  in  a  finite  number  of  variables  over  a  closed 
convex  polyhedral  set  can  be  expressed  as  a  problem  of  the 

form  1.2  and  be  solved  using  the  simplex  method,  per  example 
if  we  wish  to 

(1.13)  minimize  <yy  m  y 

J  J 

subject  to  ycK  a  convex  polyhedral  set,  we  first 
express  K  as  the  intersection  of  a  finite  number  of  half 
spaces  (A. 2. 7): 

£DiyJ  ^  di  1  -  1,  ...,m. 

We  then  define 

yj'yj  by  yj  "  yj  -  yj  y J,yj  >  0 

and  define  "slack  variables"  by 

si*di- 
Si  -  °‘ 

Since  maximizing  -yy,  is  equivalent  to  minimizing  yy 
(1.17)  Max  Xq 


r 


1 


2.  Generalized  Programming:  A  very  fruitful  generaliza¬ 
tion  of  the  fundamental  problem  la  the  following  problem 
(due  to  P.  Wolfe): 

(2.1)  max .  xn 


s.t . 


XjA*^  «  b  where  each  A*^  may  be  chosen 

J»0 


freely  fhm  *  0),  a  closed  convex  subset  of  m  +  1  dimen¬ 
sional  euclidean  space  E”*1. 

Formally  we  "solve"  (2.1)  by  first  choosing  one  representa¬ 
tive  A°*J  of  each  set  (fj  ^  and  forming  the  corresponding 
fundamental  linear  program.  Using  the  simplex  algorithm, 
an  optimal  basic  solution  and  the  corresponding  optimal 
dual  solution  ir°  are  obtained.  We  next  solve  the  n  sub- 


sroblems : 


(2.2) 


min  ir°AJ 


s.t.  AJ€| 


J  *  1|  •  •  • 


To  solve  (2.2),  let 
(2.3)  A.  -  min  ir°A^ 


A.  -  min  tt^A1^  and  A  -  min  min 

3  J  j  or  j 


and  suppose  As  -  A.  If  A  £  0  we  stop  and  if  A  <  0 
s  Os 

we  add  A'  (where  tt  A'  ■  A)  to  the  master  program  and 
reoptimize  obtaining  a  new  v  -  t*.  We  then  repeat  the 
process.  We  consider  this  generalized  algorithm  in  more 


s 


detail  in  the  next  chapter,  here  we  merely  indicate  the 

Justification  of  this  formal  procedure  in  the  Important 

case  where  the  dj j  are  polyhedral  sets.  In  this  case  (2.2) 

is  a  linear  program  according  to  1.12.  Hie  simplex  method 

applied  to  the  Jth- subproblem  yields  either  an  extreme 

point  as  a  minimum;  or  a  class  of  vectors  of  the 

form  A*^  +  with  tV  <  0  and  +  AH^cOfj  for 

all  A  £  o  and  all  In  the  former  case,  we  add  A** 

and  in  the  latter,  as  additional  columns  to  (2.1), 

referred  to  as  the  master  program.  „The  and  are 

extreme  points  and  rays  respectively  and  there  are  only  a 

finite  number  of  them  (up  to  a  normalization  of  the  H^) . 

Since  the  simplex  method  is  finite  and  the  number  of  columns 

which  can  be  added  are  finite,  the  algorithm  for  the  gener- 

* 

alized  program  with  polyhedral  coefficient  sets  is  finite. 

In  the  general  case,  the  algorithm  is  not  finite,  but  under 
very  general  conditions  converges,  which  we  will  show  later 
in  Chapter  II.  Now  we  turn  to  a  variant  of  the  simplex 
method  which  will  prove  useful  in  Chapter  III. 

3.  Generalized  Upper  Bounding  Techniques:  We  shall  have 
need  to  apply  the  simplex  method  in  a  shrewd  way  to  special 
linear  programs  consisting  of  M  +  L  equations  such  that  each 
variable  has  at  most  one  non-zero  coefficient  in  the  last 
L  equations,  and  the  last  L  equations  are  not  homogeneous, 
l.e.,  the  last  L  components  of  the  right  hand  side  are 
all  non-zero.  Dantzlg  suggested  Informally  two 
efficient  ways  of  attacking  such 

Care  ha3  to  be  taken  in  thie  use  of  this  procedure,  ,/e  give 
the  precise  results  in  Section  II. 5. 
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problems,  the  one  given  here  is  set  out  in  more  detail  in 
[  5  ],  the  other  is  described  in  [  4  ].  By  normalizing 

the  variables  and  multiplying  the  equations  by  constants 
we  can  assume  without  loss  of  generality  that  all  the  coef¬ 
ficients  in  the  last  L  equations  are  t  1.  Moreover,  for 
simplicity  in  describing  the  algorithm,  we  assume  that 
these  coefficients  are  all  +  1.  The  obvious  modifications 
required  to  handle  -1  coefficients  will  be  indicated  later. 

The  Ith  set  of  variables  or  columns,  S£  ,  will  refer  (depend 
ing  on  context)  to  those  variables  or  columns  corresponding 
to  the  columns  of  coefficients  with  1  as  their  M  +  i 
component.  So,  the  zero  set,  is  the  set  corresponding  to 
columns  with  zero's  for  the  M  +  l8t  to  M  +  Lth  coefficients. 

We  assume  that  the  system  is  of  full  rank  and  denote 

by  [A**1, . .  a  basis  for  the  system.  We  always  assume 

Ji  o 

that  A  «  A  ,  where  xQ  is  the  variable  to  be  maximized. 

The  underscoring  is  to  differentiate  coefficient 
vectors  with  all  M  +  L  components  from  the  reduced  vectors 
consisting  of  the  first  M  coefficients  which  will  be  denoted 

0 

without  the  underscoring.  There  will  be  no  underscoring 
for  individual  coefficients  A^  of  the  two  different  types  of 
vectors  since  they  differ  only  in  the  number  of  components. 

The  proofs  of  the  following  obvious  theorems  are  in  [  4  ]. 

3.1  Theorem:  At  least  one  variable  from  each  set  is 

basic . 
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3.2  Theorem:  The  number  of  sets  containing  two  or  more 

basic  variables  Is  at  moat  M. 

The  sets  containing  two  or  more  basic  variables  plus 

# 

SQ  are  called  essential  sets.  Given  a  feasible  basis. 


Obtaining  a  first  feasible  solution  is  accomplished  with 
a  phase  I  procedure  as  in  the  usual  simplex  method. 

we  assume  we  have  selected  for  each  one  basic  variable 
to  be  the  key  variable  for  that  set.  We  then  consider 

*1 

the  modified  system  (3.3)  obtained  by  subtracting,  for  each 
set,  the  key  column  frcm  every  other  column  in  its  set  (in  (3.3) 
assuming  for  simplicity  that  the  key  variable  was  the  first 
column  in  each  set).  In  this  modified  system,  the  value 
of  the  key  variables  must  be  one.  We  then  subtract  the 
coefficients  of  the  key  variables  from  the  right  hand  side 
as  we  would  variables  at  upper  bound  in  an  upper-bounded 
variables  algorithm  [L.P.E.,  Ch.  18]  for  the  simplex  method. 

We 
if 

(3.4) 


then  introduce  the  following  notation 

A'USj  we  let 

4#  4 1 

D  1  -  A  1 

-  A^  -  J  W 


and 


<,,*'*M*  '*w.~-  ;***<  r>»^v 
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o  -  »  -  £»* 


- I'1- 


are 


Moreover,  the  last  L  components  of  b  -  ^  A  * 
all  zero. 

Al 

Since  the  D  are  Incorporated  in  d,  they  no  longer 
appear  explicitly  in  our  system.  The  working  basis,  B, 
is  given  by  B  *  [D^/k^  is  basic  and  not  key} .  By  virtue 
of  theorem  3.1,  it  is  clear  that  B  has  exactly  M  columns. 
We  let  B1  *  A0  so  that  the  first  column  of  B  corresponds 
to  the  coefficient  of  the  variable  to  be  optimized.  We 
define  the  derived  system  to  be 


(3.5) 


Ty/  -  < 

AJ  not  key 


and  it  is  now  easy  to  prove: 

3.6  Theorem:  B  1b  a  basis  for  (3.5). 

Proof:  Suppose  ^  AjB^  -  0,  A  +  0. 

Since  B^  differs  from  B*^  only  by  zero  components  we  have 

y  AjB^  *  0.  But  this  implies  that  the  B^  plus  the  key 

columns  are  linearly  dependent  since  the  B^  by  themselves 
are.  On  the  other  hand,  this  set  is  obtained  from  a  (linearly 
independent)  basis  by  subtraction  of  columns  from  within 


i 


i 

f, 

5  1 

i  . 

4 


5 
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the  set  which  does  not  reduce  the  rank,  yielding  a  contra¬ 
diction./// 

By  theorem  3.2  there  exist  at  most  M  seta  with  more 
than  one  basic  variable.  These  sets  and  Sq  are  the  only 
sets  which  contain  members  of  B,  i.e.,  they  are  the  essential 
sets.  Thus,  with  each  feasible  basis  for  the  original 
system,  we  have  associated  a  set  of  L  key  variables  and  a 
basis  for  the  derived  system  (3.5)*  We  now  show  that  we 
can  carry  out  the  steps  of  the  simplex  method  using  Just 
the  inverse,  B"1,  of  B,  the  reduced  basis,  and  the  correspond¬ 
ing  basic  solution,  y  -  B"*d,  of  (3.5). 

The  first  step  is  to  obtain  a  set  of  prices  for  the 
original  problem.  Let  us  denote  by  ir  »  (Tj,...,irM)  the 
prices  cn  the  first  M  equations  and  \i  «  (p,^, . . . , M^)  the 
prices  on  the  last  L.  These  prices  are  determined  uniquely 
by  the  condition  that 

(ir,p)  A0  -  (t,h)  a'1  -  1 

J, 

(t,p)  A  1  -  0  1  -  2,...,M  +  L. 

ft  ■  (B_1)1  has  the  property  that  ft  B1  -  ft  A0  «  1, 

and  ft  B1^  «  0  J  ■  2,  ...,M,  i.e.,  ft  is  a  set  of  prices 

for  (3.5).  To  extend  ft  to  a  set  of  prices  for  our  original 
problem,  we  merely  set 

(3.7)  “  "ft  A  *  i  -  1, 


1  1 


<■  '  «*•>*■»•*■ 
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Because 

(1S$)  A  1  ■  (tt,?jC)  A  *  -  0  if  A  *  is  key  by  (3,5)  or 
(ir, ft)  A  1  -  (ir,ft)  (B1  +  AA)  for  some  A*  key,  A^1  not  key 
-  (*,*)  B1  +  A* 

“O+O. 

Thus  (ir,$)  does  form  a  set  of  prices  for  the  original  system. 

Using  these  prices  we  can  "price  out"  the  columns  of 
the  original  system  to  find  one  to  enter  into  the  basis. 
Using  the  usual  simplex  criterion,  the  Incoming  column  A8 
would  be  chosen  by 
A„  «  min  A, 


s 


where 


J 


Aj  •  (ir,u)A^  -  ^  iriAi  +  ^1  for  A^€SI 

If  Aq  ^  0,  we  have  an  optimal  basic  feasible  solution  and 
we  are  done;  otherwise  wr  bring  A8  into  the  basis  (assume 
AacS0)  .  To  do  this,  we  mue",  express  A8  and  b  in  terms  of 
the  current  basis  for  the  original  system.  If  we  let 

_  Tl”1/*8  _  A  AQ 


then 


(3.8)  (A8  -  A*°) 


b’ad  -  B"a(Ab  -  A  a) 
M 


-  £  ^b1  ■  l  3  - *  *: 


where  indicates  the  column  number  in  (3.3)  corresponding 


th 


to  the  i  column  of  the  working  basis  and  denotes  the 


column  number  of  the  corresponding  key  variable. 

We  denote  the  representation  of  A8  in  terms  of  the 
current  basis  by  A8,  that  Is 


M+L 


1-1 


Prom  (3.8)  we  see  that 


1  -  ^D8  If  A^1  -  A*a 

Vt^ 


(3-9) 


_a  ^1  *lfc 

D®  If  A  *  A  for  some  t 

-  V  D®  If  A  1  -  A  ^  for  some  vt  y  A 

V4  0 

0  otherwise. 


The  current  values  for  the  variables  in  the  basis, 
are  found  in  a  similar  way.  That  Is,  let  d  -  d^, ...,dM 
be  given  by 

(3.10)  d  -  B"1  (b  -  J\Ai)  -  B-1d 
then 

(»  -  £***)  -  ‘  (a"1  *  A*1) 

and  as  in  (3.9)  the  b^  are  given  by 


(3.11) 


if  A  1  is  key 
^1  ^t 

A  -A  for  some  t 


0  otherwise 


4 


Finding  the  variable  to  leave  the  basis  is  accomplished 
in  exactly  the  same  way  as  in  the  ordinary  simplex  method. 


(3-12)  6  ■  f  ■  T  A 

*r  *i  >  0  *1 

where  we  of  course  require  that  >  0.  Let  us  assume  that 

Jr 

A  eSp.  Then  three  cases  can  occur  in  the  updating  process: 

Jr 

(a)  If  A  eSa,  the  outgoing  variable ,  Is  the  key  variable 
In  S  a,  then  B  remains  unchanged,  and  As  simply  replaces 
^r 

A  as  the  key  variable  In  S^.  This  requires  the  updating 
of  d  which  is  accomplished  as  follows: 


3  :•  B_1  (b  -  ^  +  ’  *')# 


Where  the  symbol  does  not  Indicate  equality  but  rather 
that  the  expression  on  the  right  replaces  (updates)  the 
variable  on  the  left. 


(3.13) 


B‘1  (*  -  j/‘  -  A)  *  B'1  ^ 


-  a  -  b’1  (a8  -  xT) 

-  a  -  c8 


Jr 

Observing  that  A  -  A  ,  this  is  easy  to  compute  since 
we  already  have  d,  and  fi8  was  already  generated  to  determine 
the  Xj. 


« 


30 


WY*  1 

(b)  If  A  ia  not  a  key  variable,  then  we  update  B” 

J 

simply  by  pivoting  on  the  column  T5B  on  the  row  which  A  r  - 

A0  inverse  of  the  n 

A  v  occupies  in  the /^working  basis.  In  symbols  B  :«  PB 

where  P  is  the  matrix  which  performs  the  pivot,  d  is  updated 

by  applying  P  to  the  old  d. 

(c)  If  p  *  o  and  A^r  is  key,  we  must  first  change  the 

key  variable  in  Sp  from  A  ,  this  changes  the  working  basis 

and  its  inverse.  To  do  this,  we  consider  all  columns  of  B 

corresponding  to  the  set  Sp,  there  must  be  at  least  one 

such,  since  after  As  enters  the  basis  Sp  must  contain  a 

basic  variable  (Theorem  3.1).  One  of  these  columns,  call 

it  A*,  is  to  become  the  key  variable.  To  obtain  the  new 

working  basis,  B,  from  the  old  one,  B,  we  multiply  the 

column  A*  -  A^r  in  B  by  -1  to  obtain  A^r  -  A*  and  then 
ik  J_ 

subtract  A  -  A  r  from  every  other  column  of  the  form 
A*J  -  A^r  for  J  «  *  to  obtain  A^  -  A*.  That  is,  B  :■  BT 
where  T  is  of  the  form 

1 

*  1 

0  •  •  *0  *1  •  •  •  *1  "1  *li  •  •  *1  0  •  •  •  0 

1 

•  1 

where  the  -l(s  occur  In  the  columns  corresponding  to  Sp. 

B”1  :■  T"^”1  and  it  is  easily  verified  that  T~*  -  T  since 
applying  the  process  twice  replaces  A  r  as  the  key  variable. 


(3.14) 
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The  values  for  “3  are  updated  by  applying  T”1.  How  with 
the  new  key  variable  In  S we  simply  go  through  step  (b). 
With  our  updated  B”1,  d  and  key  variables  we  are  now  ready 
to  make  another  Iteration. 

When  -l*s  appear  In  the  last  M  +  L  equations,  the 
algorithm  can  be  changed  In  a  quite  obvious  way.  Theorems 
3.1,  3.2,  and  3.6  are  still  valid  and  we  can  require  that 
each  key  variable  have  a  +1  In  its  last  L  components  since 
clearly  each  set  must  have  such  a  column  which  Is  basic. 

In  the  pricing  process,  if  Is  to  be  priced  and  has  a 
negative  one  in  one  of  the  last  L  equations,  the  appropriate 
U  Is  subtracted  rather  than  added  to  rA^.  To  form  the  dif¬ 
ference  columns,  D^,  the  key  column  is  added  rather  than 
subtracted  from  columns  with  a  -1,  and  appropriate  modifi¬ 
cations  in  (3.9)  and  (3.11)  must  reflect  this.  Otherwise 
the  algorithm  remains  unchanged. 
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Chapter  II.  A  geometric  Theory  of  Optimization 

1.  The  fundamental  problem:  Most  of  the  material  in  this 
chapter  is  based  on  the  following  problem. 

(1.1)  Max .  Z 

s.t.  UQZ  +  P  «=  0 

where  UQ  is  the  unit  vector  (1,0,0, ... ,0),  P  may  be  chosen 
arbitrarily  from  X#  a  closed  convex  set  in  m+  1  euclidean 
space,  and  Z  is  a  scalar  variable.  This  problem,  by  the 
convexity  of  X,  is  trivially  equivalent  to 

(1.2)  Max.  Z 

n 

8.t.  u0z+£*jFJ-° 

J-l 

• 1 

(v,  *  o) 

P“eX  J  ■  1, .,.,n 

where  n  can  be  taken  arbitrarily  large.  Formally,  we  can 
take  the  dual  of  (1.2)  obtaining: 

(1.3)  Min.  m. 

S.t.  TTUq  ■  TTq  ■  1 

tP  +  n  £  0  for  all  P €X. 


a 
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We  want  to  establish  duality  relations  between  (1.1) 
and  (1.3)  modeled  on,  the  results  of  theorem  1.1.5  and  1.1.6 

i 

!  ,  in  linear  programming.  In  particular:  if  feasible  solu¬ 

tions  exist  to  both  (1.1)  and  (1.3)  then  min  p,  *  max  Z; 

, 

I  if  feasible  solutions  exist  for  one  problem  and  not  the 

other,  then  the  solutions  are  unbounded  in  the  direction 
of  extremization;  and  if  an  optimal  solution  exists  to 
one,  an  optimal  solution  exists  for  the  other  and  min  p.  = 
max  Z.  Unfortunately  all  this  cannot  be  accomplished,  and 
the  principal  obstruction  is  the  problem  of  the  non- zero 
multiplier,  which  raises  havoc  throughout  the  theory  of 
optimization.  It  appears  at  a  crucial  Juncture  in  proving 
the  "equivalence"  of  matrix  games  and  linear  programming; 
as  we  shall  see  later.  It  Is  a  motivation  for  constraint 
qualifications  In  non-linear  programming;  It  appears  In 
the  question  of  normality  In  the  calculus  of  variations;  and 
of  course  In  the  theory  of  optimal  control.  To  make  clear 
what  Is  Involved,  we  consider  a  very  simple  geometric 
Interpretation  of  (1.1),  (1.2)  and  (1.3). 

If  we  let  £  -  “  p2  *  •••  ■  pm  *  0,  and 

P0  arbitrary)  then  problem  (1.1)  may  be  lnberpretsd  as  finding  the 
"lowest"  point  (-Z,0,...,0)  common  to  £  and  the  convex  set 
X  (Figure  1.4). 


Figure  1 . 4 

Since  X  is  closed,  if  such  a  "lowest”  point  exists  it  will 
be  a  boundary  point  of  X  and  hence  belongs  to  a  supporting 
hyperplane  of  X,  determined,  say,  by  ttP  +  ft  »  0.  Since  the 


representation  of  the  supporting  hyperplane  is  homogeneous 
in  the  variables  ft  and  $  we  can  multiply  'rr  and  ft  by  a  common 
non-zero  multiplier  and  get  an  equivalent  supporting  hyper¬ 
plane.  By  the  outrageously  prejudiced  choice  of  notation 
in  (1.3)  and  Figure  1.4,  it  Is  clear  that  we  wish  to  convert 
if  and  ft  into  a  solution  of  (1.3),  and  generally  we  can. 
Unfortunately,  Figure  1.4  has  prejudiced  the  issue  in  another 
way,  the  "hooker"  comes  in  situations  as  pictured  in  Figure  1.5. 


(a) 


r 


Figure  1.5 
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In  order  to  satisfy  the  constraint  ttXJq  «  ir^  -  1  in  1.3  by 

tt,  we  must  perform  the  obvious  normalization  t  «  —  if, 

*0 

p.  =  ft,  but  in  1.5  (a)  the  only  supporting  hyperplane 

has  iTq  «  0.  On  the  other  hand.  Figure  (b)  illustrates 
a  case  where  there  is  a  supporting  hyperplane,  but  none  such 
that  tFq  *  0,  and  Z  is  unbounded  above  (-Z  unbounded  below). 

In  the  "nice"  case.  Figure  1.4,  it  is  essy  to  see  we  can 
interprete  the  dual  problem  (1.3)  as  the  problem  of  finding 
the  supporting  hyperplane  "underneath"  X  with  the  highest 
intercept  with  £.  By  "underneath"  we  mean  that  for  all  PcX, 
tP  +  p  £  0,  and  the  highest  Intercept  is  p.  The  reason  we 
insist  on  the  "non-zero  multiplier"  Tq  is  that  while  it  is 
necessary  that  each  optimal  solution  of  (1.1)  belongs  to  a 
supporting  hyperplane,  it  is  also  sufficient  for  a  feasible 
solution  to  (L.)  to  be  optimal  if  it  belongs  to  a  "non- 
vertical"  supporting  hyperplane,  l.e.,  one  in  which  Tq  >  0. 

In  cases  such  as  1.5  (a,b)  we  cannot  resolve  the  question 
by  using  the  dual.  In  particular,  in  Figure  1.5  (a)  the 
only  supporting  hyperplane  for  the  optimal  feasible  solution 
is  vertical,  but  there  are  also  other  feasible  solutions 
belonging  to  the  same  supporting  hyperplane.  So  the  hyper¬ 
plane  does  not  uniquely  determine  the  optimal  feasible  solu¬ 
tion,  as  it  does  in  cases  where  Tq  >0. 

If  the  set  X  is  polyhedral,  as  in  linear  programming, 
life  is  easier.  As  indicated  schematically  in  Figure  1.6, 
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2  2 

if  there  is  a  vertical  supporting  hyperplane  (tt  ,4  )  at 
the  optimum  for  (1.1),  there  is  also  a  non-vertical  one, 
(ir1, M-1 )  and,  as  a  matter  of  fact,  a  whole  manifold  of 
supporting  hyperplanes. 


This  is  a  result  of  the  "pointedness"  of  convex  polyhedral 
sets  (A. 2. 13).  We  turn  now  from  this  somewhat  heuristic 
motivation  to  a  more  rigorous  development  of  these  ideas. 
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2.  A  duality  theorem  for  the  fundamental  problem:  Before 
we  prove  duality  theorems  for  (1.1)  and  (1.3)  we  make  some 
definitions . 

2.1  Definition;  A  set  yc  Em+1  la  said  to  be  bounded  below 
if  there  exists  tt,|x  such  that  TcEn+^l  Tq  ■  1  and  trP  +  |i  ^  0, 
for  all  Pex. 

2.2  Remark :  If  we  interpret  the  plus  Xq  direction  as 
"up",  y  being  "bounded  below"  simply  means  there  is  a  non¬ 
vertical  hyperplane  ( Tq  «  1)  which  is  "below"  all  points 
y,  i.e.,  if 

(2.3)  P°  -  (po'...,Pm)  8atl8fi®8  *P°  +4-0  and 

If  P1  -  (pJ,p°,p|,...,P°)€X,  th.n  P°  t  p°. 

Requiring  y  to  be  bounded  below  In  (1.1)  Is  then  equivalent 
to  requiring  that  a  feasible  solution  to  the  dual  (1.3)  exists. 
2.4  Definition:  We  say  that  a  closed  convex  set,  Xc. e”14  is 
regular  if  the  line  Z  -  {(PQ, . .  ■  P2  *  . ..  -  Pm  -  0) 

intersects  the  relative  interior  of  X>* 


See  Appendix  A  for  material  on  affine  geometry  and  the 
dimension  of  convex  sets.  In  (15a)  and  (17a)  the  results  of 
this  chapter  have  been  considerably  generalised. _ 

2.5  Remarks :  Intuitively  it  is 'clear  that  'if  Z  inter¬ 
sects  the  relative  interior  of  X»  there  exists  a  non-vertical 
supporting  hyperplane  at  the  lowest  point  of  Z  in  X* 
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Unfortunately,  the  property  of  being  regular  is  not  invariant 
under  affine  transformations.  In  this  sense,  being  regular 
is  not  a  characteristic  of  K,  but  of  K  together  with  the 
line  jS  .  In  the  case  of  polygonal  sets,  regularity  is  not 
needed  as  indicated  in  section  1  (A. 2. 13). 

To  prove  our  duality  theorem,  we  first  need  some  general 
results  from  the  theory  of  convex  sets. 

2.6  Theorem  (Bounding  hyperplane):  Let  K  be  a  convex  set 

belonging  to  a  linear  space  and  let  P°  be  any  point 

belonging  to  the  boundary  of  K  ,  relative  to  d ,  then  there 
exists  a  non-trlvlal  linear  homogeneous  functional,  f ,  on  ^ 
and  scalar  a  such  that  f(P)+  a  »0,  P  e  K  and  f(P°)  +  a  -  0. 
Proof :  See  (a.2.18). 

2.7  Corollary :  Let  K  be  a  closed  convex  cone  with  vertex 
at  the  origin  belonging  to  a  linear  subspace Jtjc.  e”*1  and  let 
P°  /  0  be  any  boundary  point  of  K  .  Then  there  exists  a 
non-trlvlal  linear  functional,  f,  on  such  that  f(P)  *  0, 
for  all  P zK  ,  and  f(P°)  -  0. 

Proof:  Since  Oe  K  ,  if  we  apply  2.6  to  K  considered  as  a 
convex  set,  we  have  f  and  a  such  that  f(P)+  a  >0  for 
all  ?t  K  .  f  (0)  -  0  implies  a  >  0.  Suppose  a  >  0.  By 
Theorem  2.6  f  (P°)  ■  -  a  <  0  and  by  linearity  and  the  fact 
that  K  is  a  cone,  xP°e  K  and  f(XP°)  -  xf(P®)  -  -  \a  for  all 
X  >  0.  For  x  >  1  f(xP°)  ■  - Xa  <  -a  contradiction./// 

2.8  Corollary:  If  P^  belongs  to  the  relative  boundary  of 
K  in  Theorem  2.6,  there  exists  f,  a  satisfying 
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f  (p°)  +  a  ■  o,  f  (p)  ♦  a  >  0  for  all  Pc  K  ,  such  that 
f(p)+q  >0  for  all  P  In  the  relative  Interior  of  K  . 

The  same  relations  hold  for  Theorem  2.7  with  a»0. 

Proof:  Let  */-  {Q-P-P^Pc^}  where  if  is  the  linear  variety 
of  minimal  dimension  containing  K  .*  4/ is  then  a  vector 

# 

See  Appendix  A  for  discussion  of  linear  varieties  of 
minimal  dimension,  affine  dimension  (a.d.),  and  other 
material  on  affine  geometry. 

space  of  dimension  k  a.d.(K)  with  0  as  a  boundary  point 
of  the  set 

K°-K  -  {P°)-{Q-P  -  ?°/?tK) 

Applying  Theorem  2.6  to  K°  in  d  ,  we  obtain  f  satisfying 
QcK  where  f  is  a  linear  (homogeneous)  functional 
defined  on  d  and  not  identically  zero  there.  Furthermore, 
f  cannot  be  0  everywhere  on  K  for  if  it  were,  c<R  - 
{Q/f  (Q)  -  0},  and  &(\d  since  f  0  on  d  ,  is  a  linear  space 
of  dimension  <  k  which  contradicts  the  definition  of  V \ 

Let  Q'c K°  be  any  point  such  that  f(Q')  >0.  Let  Q  be  an 
arbitrary  point  in  the  relative  Interior  of  K°  and  suppose 
f  (Q)  »  0.  Since  Q  is  In  the  relative  interior  of 
Q  -  x(Q'-Q)  is  In  the  relative  Interior  of  for  some  x  >0 
sufficiently  small.  But  by  linearity  0  <  f  (Q-x(Q'-Q))  - 
(l-x)f(Q)  -  xf(Q')  <0  which  Is  a  contradiction.  Hence  f(Q)  >0 
for  all  Qc  relative  interior  of  K°.  He  now  extend  the 
domain  of  f  to  all  of  E®.  This  can  easily  be  done.  Let 
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V  be  the  orthogonal  complement  of  JL.  Then 
e"1  =  !©*>=  (S  +  W/  SeJ,  WelJ) .  For  any  PeE111  we  define 
?(P)  =  ?(S)  where  P  =  S  +  W  for  some  SeJ,  Now  we 

define  f  by  f(P)  =  ?(P)  -  ?(P°).  Then  for  any  PeX, 

^(P  -  P^)  ^  0  which  implies  that 

f(P)  =  f(P  -  P°)  +  f(P°)  =  f(P  -  P°)  -  f(P°)  +  ?(p°)  -  ?(p°) 
?(P  -  P°)  -  ?(P°)  ^  -?(P°).  Strict  inequality  holding  for  P 
in  the  relative  interior  of  X»  Letting  a  =  ?(P°)  we  are 
done ./// 

Finally  we  come  to 

2.9  Theorem  (Duality):  If  in  (1.1)  and  (1.3)  X  is  bounded 
below  and  is  regular,  (l.l)  and  (1.3)  both  have  optimal 
solution^  and  Min  u  -  Max  z. 

Proof:  Consider  c  =  £nX-  By  the  hypothesis  of  regularity, 

C  is  non-empty,  and  by  virtue  of  X  being  bounded  below, 

* 

and  y, jc  closed  there  exists  a  P  on  C  with  the  smallest 

* 

initial  co-ordinate.  Clearly  P  is  on  the  boundary  of  X 
so  by  the  bounding  hyperplane  theorem  2.$  there  exists 
t r,  a  such  that 

(2.10)  ttP*  =  a,  and  ttP  =  a  for  all  TeJ( 

Next  we  must  show  that  there  exists  ir,  a  satisfying 

(2.10)  and  the  additional  condition  7Tq  =  1.  Let  if  be  the 

linear  variety  of  lowest  dimension  containing  X  and  let  k 

be  its  dimension.  Two  cases  can  occur,  depending  on  whether 
■* 

P  belongs  to  the  relative  interior  of  X  or  not. 
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# 

Case  I:  If  P  belongs  to  the  relative  interior  of  K,  it 

* 

follows,  because  P  is  a  boundary  point  of  K,  that  k  <  m+1 

and  that^tf  v'.  There  is  a  1-1  correspondence  between 

linear  m-manlfolds  in  Em+*  and  relations  wy  +  a  -  0(»  ^0) . 

Let*/, {x/ir'x  +a 1  -0>  be  any  m-manifold  containing  I/* 

but  not  then  4  0.  Let  v  --1,-  ,  o  -  a  ' . 

u  *0  0 

* 

Case  II;  If  P  does  not  belong  to  the  relative  interior 
of  K,  there  exists  by  regularity  a  relative  interior  point 
P°eC  and  by  2.8  a  *  such  that  ir(P°-P#)>  0.  But  for  any  PeC  , 

?1  -  . . . «  Pffl -0  hence  v  (P°-P*)  -  ir0(po°"P0*)  ?  0  l»Plyin6  that 

*  1  ■  ;p* 

irn  >  0.  Again  we  let  *  -  -y-  *  ,  and  a  -  — . 

U  *Q 

This  establishes  a  feasible  solution  to  1.3  by  taking 
y  -a  .  All  that  remains  is  to  show  that  this  solution  is 
optimal.  Multiplying  the  two  equations  of  (1.1)  by  it  and 
u  rerpectively  and  adding  yields 
tUqZ  +  irp  -  0  or 

Z  +  (*p  +  u)  -  y  and  since 

*p+  y  >0  we  have  Z  <  y  for  any  feasible  solutions  to  (1.1) 
and  (1.3)*  Since  the  particular  values  of  z  and  u  for  the 
solutions  we  have  constructed  satisfy  z  mv  both  are  optimal./// 
2.11  Remark ;  The  bounded-below  assumption  simply  implies 
the  existence  of  a  feasible  solution  to  the  dual;  the 
regularity  condition  among  other  things  implies  the  existence 
of  a  feasible  solution  to  the  primal.  Dropping  these 
assumptions  we  now  attempt  to  imitate  the  unbounded  solu¬ 
tions  theorem  of  linear  programming  (1.1.6). 
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Ejgaggfl 


may  occur: 


(a)  (1.1)  and  (1.3)  both  admit  feasible  solutions. 

(b)  (1.1)  admits  feasible  solutions  but  (1.3)  does  not. 

(c)  (1.3)  admits  feasible  solutions  but  (1.1)  does  not. 

(d)  Neither  (1.3)  nor  (1.1)  admits  feasible  solutions. 

Proof:  ? 


(b)  X  equals  the  whole 
space 


2.13  Theorem:  If  X  Is  closed  and  convex  exactly  one  of 
the  following  occurs*: 

(a)  (1.1)  and  (1.3)  are  both  feasible  and  Max  Z  *  Inf 


Further,  If  X  Is  regular,  Inf  n  -  Min  u  »  Max  Z. 

(b)  (1.1)  Is  feasible  and  (1.3)  is  not  and  Sup  Z  *  +  «. 


(c)  (1.3)  Is  feasible  and  (1.1)  is  not,  and  Inf  m-  ■  -  ». 


(d)  Neither  (1.1)  nor  (1.3)  Is  feasible. 


This  theorem  Is  generalized  in  [17a]  . 
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Proof :  (a)  If  A  is  regular ,  the  result  follows  from  Theorem 

2.9.  Suppose  A  is  not  regular.  As  in  the  proof  of  2.9* 

there  is  an  optimal  solution  to  (1.1)*  call  it  P*. 

# 

Let  e  >  0  be  arbitrary,  and  let  P  ■  (PQ  -c,0,  . ..,  0). 

«  •  ** 
p  i  K  ,  hence  by  (A. 2. 17)  there  exists  *,<*  such  that 

wp  +  o  <  0,  and  trP  +  a  >0  for  all  PeK,  in  particular 

wP  +  a  >  0.  But  »p  -  WqPq  "  w0^p0  “*)*  *  Thl8 

implies  that  nQ  >  0;  we  assume  without  loss  of  generality 

m  ** 

that  »q  ■  1 ,  Identifying  a  with  y  in  (1.3)  we  see  that 

----  -  *  *  -  * 

»P  ■  wqPq  “  Pq  “  Pq  -e  <  -  a  ■  -y  ■  Pq  ■  *P  .  Since  e  is 

* 

arbitrary  Inf  y  -  -Pq  -  Max  Z. 

(b)  If  Sup  Z  <  +»,  there  exists  P  -  (Pq,0,  •••*  0 )jK  with 

Pq  <  Pq  for  some  point  Pe/nA.  Applying  (A. 2. 17)  as  above 
*  «  %  «  «  •  %  «* 

we  obtain  if,  a  such  that  wP+  a  >0,  wP+  <*  <0.  *Q  can  be 

•  ^ 

taken  as  1  for  the  same  reasons  as  above  and  w  »  ir  ,  y  -  a 
is  a  feasible  solution  to  (1.3)  contradicting  the  assump¬ 
tion  that  no  such  solutions  exist. 

(c)  Suppose  y-Inf  {y  |  there  exists  «  such  that  w,y  satisfies 
(1.3))  la  finite  and  let  P -{-y,0,  ...,  0}.  Let 

A  »  {P  -  (Pq,  .  .  .  ,  Pffl)  /  P^  -  P^’  ,  1  -  1,  .  .  .  ,  m,  Pq  >  Pq' 

for  P'cfO.  Ala  closed  and  PrfA  since  (1.1)  is  infeasible. 

Hence  there  exists  w  such  that  *P  <  Inf{*P/PeA}  (A. 2. 17). 

We  have,  by  the  definition  of  A,  that  Wq  >  0  (assuming  A 

non- trivial,  if  A  is  empty  the  result  is  obvious).  If 

»Q  >  o,  (*,y)  satisfying  ir--i*ir*y>y>  Inf{iP/PeA>  i8  a 

*0 

solution  of  (1.3)  with  y  <  y  contradicting  the  definition  of  y. 
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Now  suppose  ■  0  .  We  then  have  *P  ■  0  hence  *  P  >  6  > 
all  P  cK  .  Let  be  any  feasible  solution  of  (1.3). 
we  have  (*  +  Xir)P  -  -y  +  X6.  Thus  (*  +  x»),  jj  -  x«  Is  dual 
feasible.  But  p  -  X6  can  be  made  arbitrarily  negative 
increasing  x./// 


0  for 
Then 

by 
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3.  The  simplex  method  and  the  fundamental  problem 

The  algorithm  we  present  here  is  basically  a  speciali¬ 
zation  of  the  generalized  programming  procedure  outlined 
in  Chapter  I.  On  the  other  hand  we  will,  in  fact,  use  the 
results  of  this  section  to  give  a  convergence  proof  for  the 
generalized  programming  procedure.  We  consider  the  fun¬ 
damental  problem  in  the  equivalent  form  (3.1)  where  for  the 
time  being  we  take  d^  *  1. 

(3.1)  Max  Z 


Xjio 

P^cJCcEm+1  and  may  be  chosen 

arbitrarily  from  y. 

Where  we  assume  that  .V  is  closed,  bounded  below,  and 
that  the  problem  is  regular,  we  also  assume,  initially  that 
y  has  interior  points. 

Let  A  be  the  characteristic  cone  for  x(A.2.15),  and 
let  y1  and  yt1  be  sets  with  the  following  properties: 

y  *A<§  (P/P  -  *  0,  £  -  i.pW) 

A  -  (P/P  -  *  °>  P^1} 

Further,  assume  without  loss  of  generality  that.  A1  is  compact. 
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If  we  can  take  X  to  be  compact,  we  say  the  X  is  com- 


?act  enough. 


We  initiate  the  algorithm  by  assuming  we  have  m  +  1 
fixed  points  P1, . . . , Pm+1eX  such  that  (UQ, P1, . . . ,  Pm+1)  is 
a  non-degenerate  basis  for  (3.1)  where  we  take 
di  =  d2  =  ...  =  dm+1  =  1.  A  basis  is  non-degen  erate  if 
every  basic  variable  is  strictly  positive  in  the  correspond¬ 
ing  basic  solution.  Such  a  basi3  must  exist  by  the  regu¬ 
larity  hypothesis  and  the  assumption  of  interior  points 
for  X*  At  the  end  of  this  section  we  show  how  to  obtain 
this  initial  basis  so  that  P1,...,Pm+1  are  also  extreme 
points  and  also  what  to  do  if  X  has  no  interior. 

The  algorithm  goes  as  follows.  At  iteration  k  we  have 
n  =  m+k  fixed  points  P^ .  (3.1)  is  solved,  for  the  fixed 

choices  of  P^,  by  the  simplex  method  obtaining  a  point 
m+k 

(3. 2. a)  Qk  .  -  (-zk,0,...,0) 

J-l 

k  1c 

as  an  optimal  solution,  and  dual  variables  (ir  ,  p.  )  satisfying 
( 3 . 2 .  b )  Min  pk 

s.t.  TrkPJ  +  d^p.k  ^  0  J  «  1,  ...,m+k 
Aq  =  V*  =  1. 

(3.1)  is  called  the  master  problem  and  (3*1)  with 
m  +k  particular  choices  of  P^  the  restricted  master.  We 
then  try  to  solve  the  subproblem 
(3.3)  Min  mkP  +  nk 


If  Min  (ir~P  +  u"-)  =  0,  then  Q  given  by  (3. 2. a)  is  an 

optimal  solution  for  the  problem  (3.1)  with  arbitrary  choices 

for  the  P^'s  rather  than  Just  the  particular  ones  in  the 

restricted  master.  Since  the  minimum  in  (3.3)  may  not  exist, 

we  have  to  be  a  little  careful  how  we  perform  the  "pricing 

out" .  First  we  observe  that  if  ir^Q  <  0  for  QeX,  and  PeX 

that  ir^P  +  AQ)  goes  to  -®  as  A  increases  and  P  +  Aq  c  x 

for  all  A  ^  0.  The  next  observation  we  make  is  that 

Inf  irQ  <  0  if  and  only  if  Inf  irQ  <  0  .  Clearly 
QU1 

Inf  TrtJ  «  Inf  t rQ.  Suppose  Q  «  where  Q  Is  an 

Qei1  QeX  J-l 

arbitrary  member  ofX  and  >  0  Q^eX1,  J  «  l,...,n. 

Then  wQ .  T  a.irQ^  and  clearly  min  <0  if  irQ  <  0. 

J  J 

Hence  there  Is  a  such  that  <  0.  So  the  first 

step  is  to  solve  the  alternate  subproblera 

(3.4)  Min  v^P.  Since  X1  is  compact  the  minimum  is  achieved 
PeX1 

if  the  minimum  is  less  than  zero,  we  augment  (3V1)  by  an 
additional  column  pm+k+^  with  dm | ^  [  ^  -  0,  where  Pm+k+1 
is  a  minimizing  vector  solution  to  3.4.  Such  a  column  is 
called  a  homogeneous  column. 

We  then  consider  (3.3),  but  instead  of  minimizing  over  X, 
we  minimize  over  X^fiS(k)  where  S(k)  is  the  sphere  of 
radius  k  centered  at  the  origin.  We  use  X*,  because  by 
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the  same  reasoning  as  for  j} ,  Inf  (ttP /  Pex)  =  Inf  (uP/pex1). 
We  bound  the  set  using  S(k)  since  the  infimum  may  not  be 
achieved  at  a  finite  point.  If  X  is  compact  enough,  i.e., 

X1  is  compact,  we  need  not  use  the  S(k).  As  an  example, 
all  polyhedral  sets  are  compact  enough  (A. 2.7).  If 
Min  (irkP  +  /  P€^'nS(k)}  <  0  we  introduce  a  minimizing 

vector  pm+^+l  with  =  1.  If  the  minimum  is  0,  we 

increase  the  radius  of  the  sphere  by  one,  and  try  again. 

If  X  is  not  compact  enough,  it  is  necessary  to  be  able  to 
tell  whether  Min  (tt  P  +  p.  /  Pex  }  is  equal  to  zero  or  not, 
in  order  to  be  able  to  tell  when  to  terminate  the  algorithm. 

To  illustrate  the  situations  which  may  arise,  we  con¬ 
sider  the  following  examples.  In  each  case  ir  =  (tTq, tt^ )  «  (1,0). 
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In  this  example  A  -  ((x^x^)  / x^  ■  -X,x^  ■  0,  X  £  0}, 
thus  It  A^  ■  ((-1,0)}  we  would  introduce  (-1,0)  as  the 
next  (homogeneous)  column  with  dm+Jc+^  ■  0.  In  the  second 
example,  A  *  ((0,X)/X  ^  0}  so  Min  ( irQ / QcX)  «  0. 


Thus  we  truncate  at  a  distance  k  from  the  origin  and  intro¬ 
duce  P"1-**1  with  dm*+1  -  1. 

We  note  that  if  X  is  a  cone,  then  -  A  ®  (P)  where  P 
is  the  vertex  of  the  cone  and  at  every  iteration  we  consider 
only  (3.4).  If  on  the  other  hand,  x  Is  compact,  we  need 
consider  only  (3.3)  without  truncation. 

We  will  show  in  Theorem  3.7  that  this  process  converges 
on  a  subsequence  to  optimal  solutions  of  both  (1.1)  and  1.3) 
whenever  X  is  bounded  below  and  regular. 

3.5  Lemma:  If  X  is  bounded  below  and  regular,  there  is  an 
accumulation  point  (tt,ii)  of  the  sequence  (tt^,^). 
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Proof:  Referring  to  (3. 2. a)  and  (3.2.b)  we  note  that 
-4*  £  -uk+1  ^  Qq#,  for  k  ■  1,2,...,  where  Q#  ia  the  optimal 

9 

solution  of  (1.1).  Hence  there  is  a  limit  point  p.  of  the 

Jr 

sequence  p.  • 

Consider  the  plane,  H,  defined  by  ir*P  +  u1  -  0.  By 

the  non-degene: acy  hypothesis,  Q1  given  by  (3.2.b),  see 

Fig.  3.6,  is  a  relative  interior  point  of  the  m-dlmensional 

simplex  in  H  determined  by  P1,  ...,Pm+*.  Hence  the  ray 

Q  +  MQ  -  Q  )  for  X  >  0  is  an  interior  ray  of  the  cone, 

C  ,  eminating  from  Q  which  is  "generated''  by  linear  comblna- 

tions  of  the  vectors  PJ  -  Q  ,  J  *  1,  ...,m+l.  Hence  there 

is  a  "circular”  cone,  c2,  of  the  form  c“  -  (P  +  Q#  / 

2  2  p  <  2  2 

P1  +  p2  +  ***  +  pm  ^  €  po  )  Por  80me  e  >  °»  contained 

1  k  ic 

in  c  .  Any  dual  variables  it  ,p.  must  satisfy 

ir^P  +  p.k  ^  0  for  any  P^,  J  -  1,  ...,m+k  and  thus 

any  P  of  the  form 
m+1 

P  -  Y  A jPJ,  Aj  i  0  ^Aj  .1.  Since  nk 

^P - 0  for  these  P.  But  this  hyperplane  goes  through 
Q*  and  hence  ir^P  -  £  0  for  all  Pec2.  If  we  let 

P  »  (l,+t,0,0, . .  .,0)  +  Q  which  belongs  to  C  ,  then 
-  *k(l,e,0,...,0)  +  fV  - 

.k  _ _ k  i  k  — 

m  TTq  £  €TT^  ■  1  +  €1T^  >  ® 

which  implies  that  |  ,  applying  the  same  argument 
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to  1 7T |  •••i|irmk|  we  show  that  each  component  of  7rk  Is 
bounded  in  absolute  value,  hence  there  is  a  convergent 
subsequence  of  Trk./// 

The  situation  is  depicted  schematically  in  Fig.  3.6. 
3.7  Theorem:  There  exists  a  convergent  subsequence  of 
the  such  that  the  limit  v,[i  and  (-m-,0,  . . .  ,0)  satisfy 

(1.3)  and  (1.1)  respectively. 


By  a  common  abuse  of  notation,  we  denote  this  sub¬ 
sequence  also  by 

Proof:  By  lemma  3.5  there  is  a  convergent  subsequence  with 

limit  it, (j. .  We  will  show  that  (tt, m- )  satisfies  (1.3)  and 

hence  by  the  duality  theorem  2.9  (-p.,0, . . .  ,0)  =  Q  satisfies 
.  .  k  *  k 

(1.1)  and  Q  Q  ,  where  Q  is  the  optimal  solution  for 
the  problem  (3.1)  at  the  k  iteration.  Let  us  consider 
the  subsequence  which  corresponds  to  the  iterations  which 
result  in  homogeneous  columns  being  introduced  for  the 
next  iteration.  We  assert  that 

irP  >  0  for  all  PeA  or  equivalently  A1.  Suppose 

*  *  1  k 

not,  then  ttP  <  -6  for  some  P  €A  ,  b  >  0.  Since  ir  -*■  ir, 

Min  irkP  =  7rkPm+k+1  <  _6/2,  for  all  k  >  kn 
Pc/  0 

for  some  k^.  On  the  other  hand,  ir^Pm+k+1  £  o  for  all 

I  >  k  since  ir*  is  a  dual  solution  for  the  linear  program- 

ming  problem  at  the  &  iteration.  Subtracting,  we  obtain 

(w‘  -  ,rk)Pm+k+1  > 


R.  /O 
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Letting  t  and  k  go  to  Infinity  (such  that  i>k)  on  the 
convergent  subsequence,  we  have  a  contradiction  since 
lira  (w*  -  tk)  -  0. 
k  <  i 

k  • 

Similarly  we  shot,  that 

irP+  p  >0  for  all  PcK1.  Suppose  the  contrary  were 

#1  • 

true.  Then  there  exists  P  tK  such  that  »P  +  u  <  -«  <  0. 

Then  for  some  k^ 

rkPflM’K+l  +  pk  .  Min  nkP+yk 

Pc*} 

AeA1 

<  -^2,  for  k  >  kQ. 

Subtracting  and  passing  to  the  limit  in  the  same  way  as 
before ,  we  obtain  a  contradiction.  Thus  since  any  column 
of  K  can  be  written  as  a  positive  sum  of  elements  in  A* 
plus  a  convex  sum  of  elements  of  K1  we  have  *P+  u  >0 
for  all  P  c  <./// 

We  now  turn  to  the  problem  of  getting  a  starting 
solution.  The  idea  is  to  adjoin  to  K  the  artificial  points 
^(M)  .  (M.P^,  ....  pj)  J  .  1,  m+1  where  (P^,  ....  P;J) 

J  -  1,  ...,  m+1  form  a  simplex  in  E®  containing  0D  as  an 
interior  point.  We  denote  by  K(M)  the  convex  hull  of  K 
and  the  points  P^(M).  We  then  use  these  points  as  the 
starting  basis.  If  M  is  taken  large  enough  it  can  easily 
be  seen  that  the  algorithm  will  converge  to  an  optimal  feasible 


54 
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solution,  and  aft  r  a  finite  number  of  steps  the  artificial 
points  will  no  longer  be  in  the  basis. 

If  we  call  the  above  problem  the  rectified  problem, 
we  have  then  the  following  theorem: 

3.8  Theorem:  For  any  problem  (1.1)  which  is  regular  and 
such  that  X  has  a  non-empty  interior,  there  exists  an  MQ 
such  that  for  all  M  >  MQ  the  generalized  simplex  algorithm 
applied  to  the  rectified  problem  converges  to  an  optimal 
solution  of  the  original  problem  if  one  exists,  and  after 
a  finite  number  of  iterations  the  artificial  variables  can 
be  discarded. 

Proof:  We  will  not  give  all  the  details  but  will  outline 
the  essential  steps.  Since  X  is  regular,  bounded  below  and 
has  a  non-empty  interior,  there  are  linearly  Independent 
points  T1, . . . ,Tm+1  and  a  point  Q1  belonging  to  the  interior 
of  X(U  which  is  a  strictly  positive  linear  combination  of 
T1, . . .,Pm+1.  We  can  then  go  through  the  same  construction 
as  in  the  proof  of  lemma  3.5  to  obtain  a  right  circular 

cone  C2(M)  =  (P  +  Q* (Mj/P-^  +  ...  +  Pm2  £  €2(M)Pq2},  for 

# .  . 

each  M,  where  Q  (M)  is  the  optimal  solution  for  the  rectified 
problem.  Clearly  if  <  Mg,  we  have 

Qo’fMj)  £  Q0*(M2)  S  %*,  and 

a(Ma)  £  £(M2)  £  e(=  £(»)). 

Let  M,  be  any  real  number,  let  E  =  max  {7n*} 

1  J  0 
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^  «  max  [e,  Qq#  (Mx)  +  .  Then  referring  to 

Pig.  3.9  we  see  that  ^(Mq)  belongs  to 

C2(M1)  -  (P  +  Q#(M1)/P12  +  ...  +  Pm2  ^  €2(M1)P0J 

^  4  O 

J  «  1,  ...,m  +  l,  and  as  a  matter  of  fact,  PJ  (Mq)€  C  (M)  fcr  all 
including  (passing  to  the  limit)  M  ■  ».  Clearly 
any  dual  variables  tt,m.  which  are  an  optimal  solution  to 


the  dual  of  the  rectified  problem  must  satisfy  ttP  +  p.  ^  0 
for  all  Pcc(®)n  X(M).  But  for  M  >  Mq,  P^(M)  belongs  to 
the  interior  of  C2(»)  and  hence  ir?^  +  p.  >  0  (2.8) 

J  ■  l,...,m  +  l.  Hence,  by  the  convergence  of  TT^,(ik  to 


some  such  7 r,n.j  after  a  finite  number  of  steps  k^,  H 

If 

M-  „•  0  J  -  1,  ...,m+l,  k  >  k^.  But  for  any  basic  variable 
P  for  the  kth  iteration,  we  must  have  TrkP  +  »  0./// 


The  argument  is  depicted  schematically  in  Fig.  3.9. 
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This  "phase  I"  procedure  however  depends  on  the  fact 
that  MQ  is  known  in  some  a  priori  manner.  In  case  X  is 
not  full  dimensional,  we  simply  introduce  the  m  +  1  linearly 

<vl  <\,mxl 

independent  points  P  as  before  and  if  M  is  large 

enough  the  process  will  converge  to  an  optimal  solution  to 

the  original  problem.  However,  in  this  case,  a  number  of 
~  1 

PJ,s  equal  to  the  deficiency  of  X  will  remain  in  a  basis 
although  the  value  of  the  corresponding  variables  will 
converge  to  zero. 

We  now  turn  to  some  applications. 


4.  Applications  to  linear  and  convex  programming,  and  to 
the  decomposition  principle 


The  first  application  we  consider  Is  to  linear  pro¬ 
gramming.  We  consider  the  linear  programming  problem  in 
the  form: 

(4.1)  Max  Z 

s.t.  UqZ  +  Ax  »  b 
x  £  0 

We  let  =  (P  *  Ax  -  b/x  £  0)  which  is  a  polyhedral  cone 


A  «  (A1,  ...,An},  and  moreover  take  P*,...,?®  as  homogeneous 

columns  and  do  not  use  the  ^  Aj  ■  1  equation  at  all  since 

we  are  dealing  with  a  cone.  Since  we  have  one  less  equation 
we  need  one  less  point  in  our  starting  solution.  Furthermore 


:  Xj 

have  to  worry  about  regularity  because  of  the  "pointedness" 
property  (A. 2. 13)  of  polyhedral  sets.  In  passing,  we  may 
note  that  the  regularity  assumption  for  linear  programs  of 
full  rank  is  equivalent  to  assuming  the  existence  of  a  non¬ 
degenerate  basic  feasible  solution  [L.P.E.,  p.8l]  .  With  these 
conventions  the  generalized  simplex  method  becomes  the 
ordinary  simplex  method  arid  the  duality  theorem  2.9  becomes 
the  corresponding  one  for  linear  programming,  Theorem  1.1.5. 

To  see  this  we  notice  that  any  supporting  hyperplane  of 
K  determined  by  tt,  y  passes  through  the  vertex  -b  of  K  . 

That  is  n(-b)+  y  -  0,  or  y  -nb  and  at  optimality 
min  y  -  wb  -  max  Z . 

The  choice  of  A1  given  above  is  not  the  only  possible 
choice,  a  common  scaling  device  is  to  take 

1  (  a1  An  \ 

A  •  VT.  T.  «  >  •••>  ~Ti'n i  (  which  Is  Just  the  projection 
(  I  I  AJ‘I  I  llAnll) 

of  the  columns  of  a  onto  the  unit  sphere.  This  can  be 
varied  by  taking  various  choices  for  the  norm. 

A  second  application  is  to  convex  programming,  in 
particular  to  the  convex  programming  algorithm  of  G.B. 

Dantzig  [L.P.E.,  Ch.24]  .  In  fact,  it  was  this  algorithm 
which  motivated  the  approach  used  in  this  chapter  and,  in 
particular,  the  proof  of  the  convergence  of  the  generalized 


since  K 


-{i‘j 


0  A^eA*!  is  polyhedral  we  do  not 
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simplex  method  is  a  rather  straightforward  generalization 
of  the  convergence  proof  given  by  Dantzig  for  his  algorithm. 

The  problem  we  consider  here  is  the  following: 

Find  x  -  (x^,  , xn)c(R  a  compact  enough  convex  set 
in  En  and  Minimum  w  satisfying 

L^ (x)  -  0  i  "  lj2j  •  •  • j  r 
(4.3)  ^(x)  <0  1  -  r+1,  ...,  m 

♦0(x)  -  w  (Min) 

where  ^(x)  -  L^(x)  for  i  -  1,  . ..,  r  are  linear  (inhomogeneous) 
and  ^(x)  for  i  -  0,  and  1  -  r+1,  . ..,  m  are  continuous 
convex  functions  (not  necessarily  differentiable). 

Let  K  -  { (PQ,  ...,  -  ^(x)  -  L±(x)  1-1,  . ..,  r, 

Pi  -  ^(x)  i  •  0,r+l,  . ..,  ra,  x e(R}  .  Clearly  <  is 
convex  and  closed;  but  K  need  not  be  regular  or  bounded 
below.  Dantzig  assumes  that  <R  is  bounded  which  guarantees 
that  K  is  bounded  below  and  makes  the 

4. '4  Non-degeneracy  assumption:  The  homogeneous  parts  of 
the  are  linearly  Independent  and  there  exists  a  point 
x°c( R  such  that  Li(x°)  ■  0,  i  •  1,  ...,  r  and  ♦1(x°)  <  0 
for  i  -  r+1,  . . . ,  a. 

4.5  Theorem :  The  non-degeneracy  assumption  4.4  implies 
that  (4.3)  is  regular. 

Proof;  Since  Pc  K  implies  that  P^,  ...,  Pp  -  0,  K  n  at  most 
m+l-r  dimensional.  Let  P°  -  (^(x0)^,  ...,  0,*p+1(x°), 

...,  4a(x°)).  Then  by  definition  of  K  if  P  is  such  that 
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p0  -  pO’  P1  "  •••  "  Pr  "  °'  Pr+1  m  ‘r+lJ 
then  P  c  K  .  The  set  of  all  such  P  is  obviously  a  cone  of 

dimension  jtj  + 1  -  r  containing  the  point  P1  ■  (♦q(xq)  + 1,0, . . .  ,0) 

as  an  interior  point.  The  following  conditions 

are  equivalent  to  (4.4):  The  homogeneous  parts  of  are 

linearly  independent  and  there  exists  points  x*, 

i  -  r+1,  . ..,  m  in  K  such  that  Lj(xi)-0,  j  *  1,  . ..,  r 

and  4>1(xi)  <  0,  i  -  r+1,  . ..,  m. 

Thus  the  duality  theorem  2.9  applies  to  the  convex 
programming  problem  and  the  generalized  simplex  method 
converges  to  an  optimal  solution. 

The  application  of  our  procedure  to  the  decomposition 
principle  [L.P.E.,  Ch.23 J  is  straight  forward.  A  decomp- 
sition  problem  can  be  formulated 


_0  _0  _0 
>  P-.i»  •••>  Pm  >  Pm 


Max  Z 

(4.6)  s.t.  UQZ  +  axx  -  bx 

A?x  -  bg 
x  >  0 

where  we  assume  Ag  is  of  a  very  special  structure,  e.g., 
of  transportation  structure  or  a  matrix  which  can  be  de¬ 
composed  so  that  if  AgX  -  bg  were  the  only  constraints 
Involved,  the  solution  would  be  quite  easy.  We  can  also 
apply  the  theory  if  {x  j  AgX  -  bg,  x  >  0  }  is  replaced  by 
an  arbitrary  convex  set.  The  constraint  UqZ  +  A^x  ■  b^ 
on  the  other  hand  is  arbitrary. 

In  this  case 

JC  -  {  Axx  -  b-j/AgX  *  bg ,  x  >  0 } 

K1  -  (AjX  -  b^/x  is  an  extreme  point  for  the  constraint 


f 
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set  AgX  =  bg,  x  £  o}#  and  A  »  (A^  /  where  Agx  -0,  x  >  0>  . 
With  these  definitions,  the  subproblem  (3,3)  becomes 
Min  irk  (Axx  -  bx)  +  \ik 

(*-7) 

s.t.  AgX  «  bg 
x  ^  0 

which  Is  by  hypothesis  easy  to  solve.  If  (4.7)  Is  unbounded 
below  a  homogeneous  column  from  A.  Is  found  whlcn  Is  Intro¬ 
duced.  Thus  the  generalized  simplex  method  applied  to 
(4.6)  Is  simply  the  decomposition  algorithm  of  Dantzig  and 
Wolfe.  For  the  linear  decomposition  problem  we  need  not 
Introduce  a  point  of  K1  at  each  iteration.  On  the  other 
hand,  if  {x  lAgX-bg,  xj>0}  is  replaced  by  a  nonpolyhedral 
convex  set  we  must  introduce  points  of  K1  sufficiently 
often  to  guarantee  convergence  to  an  optimal  solution. 


I 
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5.  Generalized  Programming: 

As  a  generalization  of  (l.l)  consider 
(5.1)  Max  z 

s.t.  U02+£V.O 


where  each  la  a  closed,  convex  set  in  Era+1.  (5.1)  has 
the  natural  dual 
n_ 

(5.2)  Min 


s.t.  7TPJ  + 


1  > . . . , n 


V 


-  1 


For  these  programs  we  have 

5.3  Theorem:  If_X  -  (T)  ■  (F/p  -YV.pW) 

1b_  closed,  regular  and  bounded  below  then  (5.1)  and  (5. 
both  have  optimal  solutions,  and  Min  ^  »  Max  Z. 

ft^oof:  Apply  the  duality  theorem  2.9  to 
(5.4)  Max  z 


a.t.  UqX  +  p  «  0 
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*  #  * 

This  gives  us  *  t  y  t  and  P  which  are  optimal  solu- 

* 

tions  to  the  dual  and  primal  respectively.  P  is  clearly 

£ 

an  optimal  solution  to  (3.1)*  Since  P  cK  there  exists  a 
representation  in  terms  of  the  .  Denote  one  such  by 


■*  ■  r  p*j 


*  *  #  #  « 

where  *  p  +  y-0,  irp  +  y>o  for 

all  P  c  K 


Let  -  -„VJ, 

*  1 

then  ir  PJ  +  u.  > 


then  *  PJ  +  y.  >0,  for  all  P^  e  for  each  J  -  1,  n. 

o  *  Jq 

If  not  there  would  exist  some  P  such  that  w  P  +y.  <-e< 

J  J°" 

Let  P  -  l  P*J  +  P  0  then 

jyj0 

*  *  v  *_Jq  * 

irP+W  -  >  irpJ+trPu  +  y 


V  *  *1  4 

<  £  ir  P  -  e  +y 

J-l 


-  -  c  <  0  contradiction. 

*  * 

Hence  ir  ,yj  J  ■  1,  n  is  feasible  in  (5.2).  Conversely, 

let  J  •  1,  . ..,  n  be  any  feasible  solution  to  (5.2) 

and  let  y  ■  £  yj  then  ir,y  is  feasible  in  (5.4)  and  hence 

*  # 

y  >  y  .  For  y j  -  y j  equality  holds./// 

From  the  proof  of  the  previous  theorem  we  have 


immediately : 
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*  v  * 

5.5  Corollary:  For  every  P  ,  tt  ,  p.  which  are  optimal 

solutions  to  the  (5.4)  and  Its  dual  respectively,  there 

#  #  #  #  1 

corresponds  optimal  solutions  P,7r,p.j-7rP,; 

J  m  1,  ...,n  of  (5.1)  and  (5.2)  respectively  where 

*  *  *  #  <  *  * 
P__«_P1  +  ...  +  Pn  .  Conversely,  If  P  w  ,  p.  ^ 

J  m  1,  ...,n  are  optimal  solutions  for  (5.1)  and  (5.2) 

« 

M-j  are  optimal 


respectively,  then  P 


solutions  to  (5.4)  and  Its  dual  respectively. 

Remark :  It  may  turn  out  that  the  direct  sum  of  a  finite 
number  of  closed  convex  sets  Is  no  longer  closed  and  In  this 
case  the  correspondence  In  corollary  5*5  may  not  be  valid. 
If,  however,  at  most  one  of  the  ^  la  not  bounded,  then 
corollary  5«5  holds. 


As  an  example,  consider  the  following  two  sets: 


The  origin  Is  a  limit  point  of  X  ■  X1  ©x2  but  cannot  be 

1  2  1  2  2 
expressed  as  the  sum  of  P  +  P  ,  P  ex,  P  eX  • 
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Geometrically  we  can  illustrate  theorem  5.5  in  the 

two  dimensional  case  in  the  following  way: 

1  2 

Theorem:  Let  X  and  X  be  two  closed  convex  subsets  of  the 
plane  with  X1  bounded;  and  let  X  «■  X*  ©  X2  be  their  direct 

sum.  Let  £  be  any  directed  line  intersecting  the  relative 

* 

interior  of  X  and  let  P  be  the  minimal  point  on  Xn£ 


relative  to  £.  Then  there  exist  parallel  supporting  lines 

HjH1,!!2  to  X  at  P*,  to  X1  at  P*1  and  to  X2  at  P*2  respec- 

#  #1  *p  1#  2# 

tively  such  that  P  »  P  +  P  and  P  and  P  are  minimal 

112  2  12 
points  on  X  n  £  ,  X  respectively  where  £  and  £  are 

parallel  and  in  the  same  direction  as  £. 


Apply  the  identification  in  Corollary  5*5>  to  the  problem 
for  X,  X1,  X2./// 
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The  algorithm  we  use  for  solving  (5*1)  is  the  obvious 
one.  We  assume  we  have  P^'1  i  .  1,  ...,m-ni+l  which  together 
with  UQ  forms  a  basis  for 
(5.6)  Max  Z 


s.t . 


u0z  + 


Lu  L-i 

J-l  i-1 


0  . 


? 


1, . .  .,n 


PJ,ieX'J,  dj  1  «  0  or  1. 

t* 

For  the  k  iteration  we  solve  (5.6)  as  a  linear  program 
obtaining  dual  variables  7rk,p.kj  J  -  1,  . ..,n.  We  then 
solve  the  n-subproblems 


(5.7) 


Aj  -  min  irkP^  + 
s.t.  P^eX^ 


and  let  A  *  min  A..  If  A  ^  0,  we  are  done  using  Theorem 

J  J 

5.3.  If  not,  suppose 

0  >  A  »  A.  «  ti^P  0  +  M-k«  ,  then  P  0  is 
Jn  J  r\ 


introduced  into  (5.6)  with  the  corresponding  d  »  1.  If 
the  inflmura  is  not  attained,  we  make  the  same  modifications 
as  In  Section  3  either  truncating  X  or  introducing  a  homo¬ 
geneous  column. 

We  now  indicate  a  convergence  proof  for  this  algorithm. 


The  fact  that  X 


is  bounded  below  implies  that 
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1  k 

each  X  is  bounded  below.  Hence  p  >  is  bounded  below.  On 

V  k  k 

the  other  hand  )  p  ^  -  p  is  monotonic  non-increasing, 

ic 

so  clearly  for  some  Jq,  Pj  is  bounded  above  and  converges 

on  a  subsequence.  Excluding  p  from  the  sum,  the  same 

J0 

argument  may  be  repeated  for  the  rest  of  the  components. 

k 

Hence  there  is  a  subsequence  of  the  p  j  converging  to  p^. 
There  is  a  convergent  subsequence  of  the  tt^  converging  to 
some  ir  by  the  reasoning  as  in  Lemma  3.5.  Then  applying 
the  same  sort  of  argument  used  in  Theorem  3.7*  we  can  show 
that  ir,Pj  form  an  optimal  solution  for  the  dual  problem 


(5.2).  If  we  let  Q* 


?a':’ 


be  the  optimal  solution  at 


th  J  k  * 

the  k  iteration,  we  have  Q  -*•  P  the  optimal  solution 

of  5.^.  Further,  we  know  that  irQk  +  ^  Pj  irP*  +  ^  p^  -  0 

1  k 

and  irQ*"  +  pj  -♦  0.  This  does  not  imply  however  that  the 
1  k 

Q*"  have  convergent  subsequences.  If  we  have  that,  each 

1  Ik 

Jr  is  compact  except  for  possibly  one,  then  the  do 

converge  on  a  subsequence. 

Now  we  consider  the  generalized  linear  program  of 

Wolfe  and  Dantzig  which  is 


(5.8) 


Max  Z 


a.t.  JJqZ  + 


Ix)*3 


- 0 


s <  J  •  1, where  fl/j  la 
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a  closed  convex  subset  of  E  . 

If  we  make  the  identification 

XJ  «(PQJ/  QJe^,  p  ^  0}  J  =  1, . . .,n 
and  *  (-Q0),  (5*8)  is  equivalent  to 


(5-9 


s.t.  v+  I  xjpJ  -  e°  -  o 


“  0  J  =  0, . . . ,n 


J  —  1  f  •  •  •  j  n 


where  the  restraints  Aj  =  1 ;  J  n  are  eliminated 

since  y? t  J  =  1, ...,n  are  cones.  The  formal  dual  of  (5.8) 
is, by  applying  (5.2)  to  (5*9)  and  making  the  proper  identi¬ 


fications  ! 
(5.10)  I 


Min  p.  »  ttQ 
s.t.  ^  0 


QJ€0f, 


J  =  1) . . . 


7rtJ0  =  7T0  =  ! 


We  then  prove  the  following: 

5.11  Theorem:  If  there  exists  a  tt,  such  that  ttq  a  1, 

=  0  for  all  . _ J  =  1,  . . .  ,n  and  if  the 

J 

cone  generated  by  07  is  closed  and  the  line  0  + 


w 

Intersects  its  relative  interior  then  both  (5*8)  and  (5*10 


have  optimal  solutions  and  Min  p.  =  Max  Z. 

Proof:  We  simply  show  that  the  hypotheses  imply  2.9  for 

the  problem (5. 9*  v 

P  ^  0. 

ttQj  £  0  trpPJ  =  0,  f\I£  -ttQ  =  0  set  |i  >  o  otherwise  set 


M-  *  irQ  then 
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x  =  xJ  ©  K° 


is  bounded  below. 


X  is  obviously 


closed  and  regular./// 

5.12  Corollary:  If  07,  is  compact  J  «  l,...,n  and  if  the 

“  —  —  J  — ■  ■  ■  ■  ■  ■  ■  — -  ■  ■  ■  — 

0 

line  Q  +  tU^,  t  real,  intersects  the  relative  interior  of 

the  cone  generated  by  then  both  (5.8)  and  (5.10)  have 

optimal  solutions  and  Min  p.  *  Max  Z. 

- -  -  —  1 

Proof:  Cones  generated  by  bounded  closed  sets  are  closed 
(A. 2. 14),  and  the  direct  sura  of  a  finite  number  of  closed 
cones  with  common  vertex  is  closed.  To  see  this  we  consider 
without  loss  of  generality,  the  case  for  n  *  2.  Suppose 
+  z  where  x^ex^  y^ex2.  Since  the  unit  sphere  Is 

compact,  there  exist  x,  y  and  a  subsequence  of  the  Integer 

Jl  such  that  xJl  |  |xJl||  ||  *  ^  ,  yJi-  |  |yJl  1 1  11^  || 

J<  J4 

and  x  +  y  -*  z,  i  -  1,2, ....  Since  are  closed 

xcX-p  yex2,  and  the  closed  cone  spanned  by  x  and  y  is 
in  X1  +  X2  -  fp^1  +  p2P2  /  prp2  *  0,  P1cfl71,P2€0r2)./// 


The  generalized  simplex  method  in  this  case  becomes 
the  generalized  programming  technique  of  Wolfe  and  Dantzig 
[L.P.E.,  Ch.22]  ard  converges. 

In  the  next  chapter  we  discuss  in  detail  the  application 
of  the  generalized  simplex  method  to  optimal  control  theory. 
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Ch.  Ill  The  Theory  of  Optimal  Control  for  Mathematical 
Programmers 

0.  Introduction:  We  can  think  of  a  system  of  ordinary 
differential  equations 
(0.1) 

=  f(x(t),u(t),t)  x(t)  =  (x0(t),...,xn(t))cEn+1 

uefl  u(t)eEr 

x(0)€X°  te[0,oo) 

as  a  dynamical  system  which  allows  us  to  travel  from  a 
point  in  3(Pat  time  t  =  0  to  various  otner  points  at  later 
times.  To  reach  these  points  we  may  make  choices  for  the 
Initial  point  x(o)  among  those  of  X°and  may  choose  various 
"control  functions"  u  among  those  in  ft.  For  certain  special 
cases  cf  (0.1)  the  set  of  points,  S^X*"5),  which  can  be  reached  at  time  T  is 
convex.  If  so,  then  consider  an  optimization  problem  such 
as  minimizing  Xq(T)  for  some  fixed  T  over  points  belonging 

to  ST  and  the  line  £  =  {(xQ,...,xn)  /  x1  =  ...  *  xfi  =0} 

In  this  case 

# 

A  special  case  is  minimizing 

J0Tf0(x1(t),  . .  .,xn(t),u(t),t)dt  subject  to 

dx.(t) 

=  f  j  (x^ (t ),..., x^ ( t ), u,  t ) ,  Xj (T)  =0  J  =  l,...,n. 

dx  (t) 

Simply  define  xQ(t)  by  -  =  fQ(x(t),u(t),t),  xQ(o)  =  0. 
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we  have  a  supporting  hyperplane  to  the  set  of  reachable 

ft 

points  at  time  T  including  (xQ(T),0, . .  .,0),  the  optimizing 
point.  To  find  the  proper  supporting  hyperplane  the  generalized 
simplex  method  can  be  used  very  successfully  especially  if 
the  system  (0.1)  is  linear  in  u  and  x.  Moreover,  since  the 
solutions  of  ordinary  differential  equations  depend  con¬ 
tinuously  on  the  value  of  the  starting  point,  it  is  generally 

ft 

true  that  an  optimal  trajectory  x  (t)  must  be  a  boundary 
point  of  Sfc  for  all  t,  0  <  t  <  T.  Thus  if  we  have  the 
supporting  hyperplane  [n(t),M.(t )j  for  0  <  t  <  T  we  know  that 

ft 

n(t)x  (t)  +M-(t)  *  0.  Such  a  "traveling”  hyperplane  can 

be  obtained  as  a  solution  of  the  adjoint  equation  of  (0.1) 

see  (  b.3  )•  For  linear  problems,  additional  hypotheses 

can  be  Imposed  to  guarantee  that  the  condition 

n(t)x*(t)  +  n(t)  -  0  determines  x#(t)  uniquely  [15;  Ch.3],[ll]. 

Mention  should  be  made  of  the  degree  of  generality 
of  the  control  systems  considered  for  the  various  theorems. 

The  rule  here  has  been  to  consider  each  theorem  for  as 
general  a  system  as  possible  without  confusing  the  basic 
ideas  of  the  proof  involved.  Hence  the  generalized  simplex 
method  is  applied  only  to  a  very  simple  linear  optimal 
control  problem  in  order  not  to  confuse  the  basic  method 
with  technical  details  concerned  with  linearization,  general¬ 
ized  trajectories  and  the  like. 
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In  this  chapter  much  use  is  made  of  papers  in  the 
literature  of  control  theory  [7,  10,  11,  12,  14,  15,  16]. 
Frequently  these  papers  treat  the  so  called  time  optimal 

control  problem.  This  problem  is  the  one  of  reaching  a 

T  n+1  0 

set  X  in  E  starting  at  time  0  from  X  in  minimum  time. 

Most  of  the  theory  carries  over  since  the  essential  character* 

istic  of  both  problems  is  that  the  terminal  point  x(T) 

belongs  to  the  boundary  of  the  set  of  reachable  points. 

However,  the  generalized  simplex  method  does  not  apply 

directly  and  modifications  of  the  method  must  be  introduced. 

For  that  reason  we  will  not  treat  the  minimum  time  problem 

here. 
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1.  The  Structure  of  We  assume  for  all  uefl  that 

f(x,u,t)  satisfies  the  conditions  of  Theorems  B.1.3  and 
B.1.6  for  the  existence  and  uniqueness  of  solutions  through 
every  point  (x,t)  of  En+1  x  [0,T].  More  particularly,  we 
assume  that: 

Al.  ucfi  implies  that  u(t)  is  a  bounded  and  measurable 
function  of  t; 

A2.  f(x,u,t)  is  measurable  in  t  for  each  fixed  x,u 
and  is  continuous  Jointly  in  x,u  for  each  fixed  t.  For  any 
bounded  region,  R,  of  E11  x  f  there  is  an  M  such  that 
|  |f  (x,u,t)  1 1  ^  M  for  almost  all  t  c  [0,T]  and  (x,u)  c  R. 

A3,  f  satisfies  a  Lipshitz  condition  locally  In  x. 

We  now  formally  define  the  set  of  reachable  points  at 


time  T,  ST(X°)  by 

(l.l)  ST(X°)  -  (xT/ there  exists  an  absolutely  continuous 

x(t),  0  *  t  £  T,  such  that  x(0)  «  X®,  ^  ■  f(x,u,t)  a.e. 

T 

In  t  for  some  ucft  and  x(T)  -  x  J . 

The  assumptions  Al,  A2,  and  A3  Imply  only  the  existence 
of  local  solutions.  If  f  changes  too  rapidly.  It  may  not 
be  possible  to  define  a  solution  on  the  entire  time  Interval 
[0,T]  passing  through  an  arbitrary  point  of  B"  x  [0,T] . 

To  get  around  this  difficulty  and  at  the  same  time  guarantee 
that  ST ( X° )  is  bounded  for  bounded  jP  we  add  some  additional 
assumptions.  First,  we  prove  the  following  lemma. 

1.2  Lemma:  Let  w(t)  be  an  absolutely  continuous  scalar 
function  such  that  whenever  w(t)  >  0.  we  have  *3?^  >  0 
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for  all  t  where  the  derivative  exists.  Then  If  w(0)  >  0, 
we  have  that  w(t)  >  0  for  all  t  e  [Q,T]. 

Proof;  w(t)  =  w(0)  +  Jq  g(x)dx  where  g(x)  is  a  measurable 
function  of  t  and  §£  «  s(t)  a.e .  [9,  p.207].  Since  w(0)  >  0, 
by  the  continuity  of  w(t),  w(t)  >  0  on  3ome  non -degenerate 
interval  [0,t  ],  Let  t  =  sup  (t/w(x)  >  jw(0)  for  te[0,t]}. 

*  *  r»  t* 

If  t  <  T,  we  have  w(t  )  =  w(0)  +  g(x)dx  >  w(0).  This 

follows  because  w(t)  >  £w(0)  >  0  implies  that  g(x)  >  0  a.e. 

#  * 

on  [0,t  ].  This  is  a  contradiction.  Hence  t  =  T./// 

We  now  make  some  additional  assumptions. 

A4.  Suppose  there  exists  V(x,t)  ;  En+1  x  [0,T]  -*■  R  which 
is  positive  for  all  t  and  x  *  0,  and  V(0,t)  >  0.  Suppose 
V  has  continuous  first  partial  derivatives  in  x  and  t  in 
En  1  x [0,T].  Furthermore,  suppose  that  | |  x  | |  — ►  »  implies 
that  V(x,t)  -*  +«  uniformly  on  [0,T].  Assume  further  that 
there  exists  G(V, t)  which  is  a  strictly  increasing  function 
of  V  for  each  t  such  that  ~  V  (x(t),t)  <  G(V,t)  for  all  solu¬ 
tions,  x(t),  of  (0.1).  Finally,  we  assume  chat  —  v(t)  = 
G(v(t),t)  has  a  solution  v  (t,v^)  on  the  interval  [0,T]  for 

* ,  .  n 

every  non-negative  Initial  condition  v  (0)  =  v  ^.0. 

1.3  Example :  An  easily  applied  special  case  occurs  [7] 
when  we  have 

(1.4)  x  •  f(x,u,t)  £  C  (||  x  | |2  +  1)  for  all  x,t,  and 
ucfi.  Take  V(x,t)  =  (||x(t)  |  |2  +  l)  and  0(V,t)  =  2CV 
Then  ^  V(x(t),t)  =  2  | |  x(t)  | |  ^  | |  x(t)  | | 
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2  ||  *(t)  ||  £  (  ^Xi2  (t))1/2 

,  II  ii  Evi 


2  ||  x(t)  | 


-  2x  •  ^  x  -  2x  •  f  ^  2C  ( | |  x(t)  | | 2  +  1)  =  2CV 

and  ^  =  2Cv  has  the  solution  v(t)  *  v(0)e^^^. 

1.5  Theorem:  If  A1  -  A4  are  satisfied  and  Is  compact, 

then  Sm(X°)  Is  bounded  and  for  every  uefi  and  x°  €  En+1, 


H  v 

there  exists  a  unique  solution  of  ^  »  f(x,u,t)  a.e.  satisfy¬ 
ing  x(0)  =  x°. 

Proof:  The  assumptions  A1  -  A3  guarantee  that  any  local 
solution  can  be  extended  In  a  unique  way  to  the  boundary 
of  any  bounded  region  in  En+1  x  [0,T]  (B .1.4).  Thus  either 
a  local  solution  x(t)  Is  defined  on  [0,T]  as  asserted  or 
there  exists  t_  such  that  11m  | |  x(t)  ( |  «.  Let 


there  exists  t_  such  that  11m  | |  x(t)  | | 

0  ^0 
v  *  sup  V(x,0).  For  any  solution  x(t)  let 


W(t)  *  v  (t;  v°  +  1)  -  V(x(t),  t)  for  the t  for  which  x(t) 
is  defined. 

Then  ^  W  >  G(v*,  t)  -  G(V, t).  By  the  monotonicity  of 

G(V,  t)  with  respect  to  V,  0(v#,  t)  -  0(V,  t)  ^  0  if  v*  ^  V; 

but  W(o)  2.  1  hence  lemma  1.2  applies  and  W(t)  £  0  for  all 

t  for  which  x(t)  is  defined.  Thus  v*(t;v°  +  l)  is  an  upper 

bound  for  V(x(t),  t)  for  these  same  t.  But  if 

lira  |  |  x(t)  ||  -  «  then  lim  V(x(t),t)  -  ». 

t-*t„ 


t_^0  t_>t0 

=  v  ( tQ,  v°  +  1 )  which 

Is  finite,  which  proves  the  second  assertion.  The  first 
assertion  follows  from  the  fact  that  v  (T;v^+l)  is  an  upper 
bound  to  V(x(T),  t)  for  any  solution  x(t)  to  (0.1)  and  the 
assumption  that  [  |  x  |  |  — ►  °o  =£>  V(x,T)  -»  »./// 

1.6  Corollary:  Let  f(x,u,t)  =  A(t)x(t)  +  u(t)  where  each 
component  of  A(t)  is  measurable  and  bounded.  Suppose  for 
all  uefi,  ||  u(t)  [|  <L  a.e.  and  that  A1  is  satisfied,  then 
S^(X°)  is  bounded  if  X°  is  compact. 
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Proof:  Let  K  =  ess  sup  | |  A(t)  | |, 

t€[0,T] 

0(V,t)  =  2(K  +  L)  V  and  V(t)  =  ||  x(t)  | |2  +  1.  Then 

3TV  =  2IIXII^IIXII  =  ^^xi)  ^ 

=  2  [x  *  (Ax  +  u) ] 

<2||x||(k||x||+||u||)  a.e . 

<  2KV  +  2LV  since  ||  x  | |?  +  1  >  ||  x  || 

=  2(k+L)V  and  apply  theorem  1.5./// 

Remark:  1.6  can  be  proved  directly.  Solutions  of  (0.1) 
for  f  =  Ax  +  u  are  of  the  form 

x(T)  =  Y(T)  x(0 )  +  Y(T)Y_1  (s)u(s)ds  where  Y(T)  is 

the  matrix  solution  of  ^  *  A(t)  Y  satisfying  Y(0)  *  I 
(B.2.5).  Since  x(O),  Y(T),  Y”^(s)  and  u(s)  are  bounded 
so  is  x(T)  for  x(0)€X,  U€fi. 

We  next  turn  to  the  question  of  whether  S,p(X)  is  closed 
or  not.  Here  we  follow  Fillippov  [7].  It  will  be  necessary 
to  consider  the  class  of  admissible  "f's",  i.e.,  G  ■  [f(x,u,t 
ucft}  as  well  as  ft  itself.  We  now  introduce  the  assumptions: 

A5.  f(x,u,t)  and  Vxf(x,u,t)  exist  and  are  continuous 
in  x,u,  and  t. 

A6.  ft  -  (u/u(t)  c  Q(x,t),  u(t)  measurable  and  bounded} 
where  Q(x, t)  is  closed  and  bounded  for  each  x  and  t  and  is 
upper  serai-continuous  as  a  function  of  x  and  t. 
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A7.  R(x,t)  =  (f(x,u,t)  /  u€ft}  is  convex  for  each 
x  and  t . 

Remarks :  By  the  continuity  of  f  in  u,  R(x,t)  is  nper  semi- 

continuous  in  x  and  t,  where  by  definition,  a  set  valued 
function  $(s)  is  upper  semi- continuous  if  for  any  s  and 
any  e  >  0  there  exists  a  6  =  6(s,e)  >  0  such  that  every 

point  of  f (s' )  is  within  a  distance  e  of  some  point  of  $(s) 

for  all  ||  s-s 1 | |  <  6 . 

1.7  Theorem:  If  Al  -  A7  are  satisfied,  S^,(X^)  is  closed. 

1  P 

Proof:  Let  x  (T),  x  (T),  ...  be  any  sequence  of  points  in 

0  12 
ST(XW),  corresponding  to  trajectories  x^t),  x  (t),  .... 

By  assumption  and  the  proof  of  Theorem  (1.5) 

II  x^(fc)  II  SUP  v*(t,v0)<*>,  for  v°  >  sup  V(0,x) . 

[0,T]  X€X° 

On  the  other  hand  |  ^  |  j  =  |  |  f(x^(t),  u^(t),  t)  |  |  M 

almost  everywhere  for  some  M.  This  follows  from  the  upper 

semi-continuity  of  R(x,t).  For  suppose  that  for  t^  -*•  t, 

^  £  that  ||  f(£^,u^,t^)  |  |  -*  »,  then  for  t^,£^  sufficiently 

close  to  t,£  R(^,t^)  is  contained  in  an  c-neighborhood 

of  R(£,t)  which  is  bounded  since  QU,t)  is  bounded  and  f 
is  continuous;  a  contradiction.  These  two  facts  imply  that 
there  exists  a  subsequence  x^(t)  which  is  equlcontinuous, 
hence  we  can  extract  a  uniformly  convergent  subsequence 
converging  to  x(t),  which  Is  absolutely  continuous  and 
such  that  ||  |  |  £  M  a.e.  These  last  two  facts  obtain 

because  x^(t),  J  =  1,2,...  each  satisfy  a  Lipshitz  condi¬ 
tion  with  the  same  constant  M. 
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let  -  y(t),  and  .  yl(t)  J.1,2,.... 

Then  y,  y*,  ...,  are  defined  almost  everywhere,  are  measur¬ 
able  and  bounded.  Let  t^  be  a  point  at  which 

exists.  We  then  show  that  ^  e  R(x(tQ),  tQ). 

Since  R(x,t)  is  u.s.c.  with  respect  to  inclusion, 
for  any  c  >  0,  there  is  a  6  >  0  such  that  R(x,t)  e  U£, 
a  closed  e  neighborhood  of  R(x(t^),  tQ)  whenever 
|  t-tQ  |  <  B  and  j  x-x(tQ)  J  <  2M6.  By  making  6  smaller 
if  necessary,  we  can  assume  that 


(1.8)  1 1  au *1 

*■*0 


for  t-t 


<  8  .  On  the  other  hand 


]t(t)  -  x(to>  .  in,  :  ,1(to > 


t-t. 


!-♦  o« 


t-t. 


I'if.  I** 


(1-9  ) 


Jo1  y1(t0  +  (t-t0)»)ds 


8o 


For  almost  all  t. 

y1^)  =  dxd4T^  =  f(xi(T)»  ul(T)>  T)  e  R(xl(T)>  T)« 

Also  for  1  large  enough  and  t  satisfying  |  tQ-t|<& 

we  have  ||  xi(tQ)  -  x(tQ)  ||  <  M6  and 

1 1  x1(t0)  -  x1(t)  ||  <  M6 ;  hence  1 1  x i(t )  -  x(tQ )  |  |  <  2M6 

implying  that  R(t,  xi(t))  cU£.  Thus  for  i  large  enough 

and  |  t-t^  |  <  6  the  integrand  of  (1.9)  is  contained  in 

limit  of  the 

Ue  for  almost  all  s  and  hence  the  /^integrals  belongs  to 

U  (U  is  convex) .  Consequently  -  ft-)..-  x^c)  €  U  and 
€  t-tQ  e 

from  (1.8)  we  have  that  £.x. \  e  U0  .  Since 

dt  2€ 

h(t^,  x(tQ)  is  closed  and  e  is  arbitrary 
dx(tn) 

— at2-  c  R(to-  x(t0n- 

Finally  the  following  lemma  due  to  Fillipov  [7]  which  we 
state  without  proof  guarantees  the  existence  of  an  admissible 
control  yielding  the  trajectory  x(t)./// 

1.10  Lemma:  Let  the  vector  function  f (x(t),u, t)*g(u, t) 

be  Jointly  continuous  in  u  and  t,  and  suppose  =  (u/u(t)eQ(t ) ) 
Is  the  set  of  admissible  controls  where  Q(t)  is  compact  and 
u.s.c.  with  respect  to  inclusion  (in  t).  Let  the  vector 
f(x(t),  u(t),  t)  describe  a  set  R(t)  when  u(t)  describes  Q(t) . 
Then  If  y(t)  is  a  measurable  vector  function  such  that 
y(t)e  R(t)  there  exists  a  measurable  vector  function  u(t)cQ(t) 
such  that  f(x(t),  u(t),  t)  =  y(t)  for  almost  all  t. 
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We  have  now  considered  systems  (0.1)  for  which  S,p(X^) 

Is  closed  and  bounded.  In  general,  however,  S,p(X0)  Is  not 

convex.  We  will  now  consider  two  ways  of  "making"  It  convex 

under  additional  assumptions.  The  following  assumptions 

lead  to  the  first  and  easiest  approach. 

A8.  Suppose  f(x,u,t)  Is  of  the  form 
n 

f±(x,u,t)  *  Y  A^(t)  x  +  ujL  1  *  l,...,n 

where  A^(t)  Is  continuous  for  each  1 J  and  suppose  fg(x,u,t) 
is  convex  in  (x,u)  for  all  x,  u,  and  t;  i.e., 
fQ( Ax1  +  (l-X)x2,  Xu1  +  (1-X)u2,t)  £ 

^f0(x1,u1,t)  +  (l-X)  f0(x2,u2,t)  for  0  £  A  £  1, 

where  u  -  (u^, ...,un) 

A9.  fi  »  (u(t)/u(t)c  Q(t),  u  measurable  and  bounded} 
where  Q(t)  is  closed,  bounded  and  convex  for  all  t  and  is 
upper  semi- continuous  as  a  function  of  t. 

1.11  Theorem:  Under  assumptions  Al,  A2,  A3,  A4,  A8,  and 
the  set 

s£(X°)  -  (x/xQ  £  xtf  xl  -  xj,...,xn  -  x* 

for  some  xfc c  Sj(X°)} 
la  convex  If  X°  is. 

Remarks r  Notice  in  A8  that  none  of  the  f^(x,u,t)  l-l,...,n 

depends  on  x^.  Sj£(X®)  Is  equivalent  to  S^(xP)  for  purposes 

of  minimization.  In  the  proof  below,  T  could  be  replaced 
by  t9  for  any  0  £  £  T  thereby  establishing  the  convexity 

of  S+(x°). 
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Proof:  (of  theorem) 

1 T  PT  4.  0 

Let  x  and  x  e  Sjp ( X  )  then  there  exist  by  the 
definition  of  u\u2,x\x2  such  that 

~  =  f  (xk(t),uk(t),t)  a.e .  0  ^  t  <  T,  k  =  l,2. 

xk(0)€X°  and  xk(T)  <  xkT,  xk(T)  =  xkT  i  =  l,...,n,  k-1,2. 
By  the  convexity  of  X°,  Xx1(0)  +  (l-X)  x2(0)€  X° 
and  by  the  convexity  of  Q(t)  uX(t)  =  Xu^(t)  +  (l-X)  u2(t) 
is  an  admissible  control.  Then  we  observe  that 

(^x^(t)  +  (1-X)x2(t))  =  X^j:  x*(t )  +  (l-X)  ^■(t) 

=  X  [Z  AiXj(t)  +  ul(fc)J  +  (i“X)[^Aixj(t)+Ui(tr] 

=  £  A^[XxJ  +(1-X)x2]  +  Xu*  +  (l-X)u2  i  -  1, 
dxQ  > 

Also  »  fQ  (x(t),u  (t),t)  is  equivalent  to 

x0(fc)  ■  x0(°)  +  Jq  fQ(x(t),uX(t),t)dt. 

By  the  convexity  of  fg  we  have  that 
(1.12)  j£  fQ(Xx1(t)  +  (l-X)x2(t),uX(t)) 

£  J0TXf0(x1(t),u1(t),t)dt  +  JqT  (l-X)fQ(x2(t),u2(t),t)dt 

Then  xX(t)  given  by 

xX(t)  »  +  (l-X)x2  i  - 

xX(t)  -  Xx*(0)  +  (l-X)x2(0)  +  J**  f0(xX,...,xX,^s)ds 

la  a  solution  of  the  differential  equation  for  the  control 
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and  initial  condition  x(0)  =  Xx*(0)  +  (l-X)x^(O).  We 
also  have  by  1.12  that  Xq  £  X  Xq(T)  +  (I-X)xq(T).  Thus  by 
construction  Xx*  ^  +  (l-X)x^  €  S^(X^)./// 

The  second  approach  to  the  convexity  problem  is  due 
to  Halkin  [10].  We  merely  outline  the  development  here, 
and  for  convenience  we  assume  that  =  (x^) .  The  appro¬ 
priate  assumptions  to  replace  A8  and  Ay  are  the  following: 

A  10.  Q  »  (u/  u(t)  cQ,  0  £  t  £  T,  u  bounded  and  measur¬ 
able]  for  some  set  QcEr. 

A  11.  For  any  trajectory  x(t)  satisfying  (0.1)  there 
exists  €  >  0  such  that  f  and  7  f  are  defined,  uniformly 
equicontinuous  with  respect  to  x,  and  uniformly  bounded 
for  all  x,u,t  €  N(x, £)x  Q*  where 

H(x,«)  -  (fot)  /  ||  x-x(t)  ||2  +  |  T-t  |2i  e2, 

0  £  t  £  T]  where  Q  is  any  bounded  subset  of  Q. 

Let  Y(t;u)  be  the  matrix  solution  of  the  adjoint 
equation 

H  -  -T  Vxf  ;  Y(T)  I, 

x  -  x(t;u) 
u  -  u(t) 

where  I  is  the  (n+l)  x  (n+l)  Identity  matrix  and  x(t;u) 
denotes  the  solution  of 

*  f(x,u,t)  a.e.  0  £  t  £  T  x(0)  -  x°. 

Then  we  define  y(t;u,v)  by 


(1.13)  y(t;u,v)  -  Y(t;v)(x(t;u)  -  x(t;v)) 
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for  0  t  T  and  an  approximation  y+(t;u,v)  to  y(t;u,v)  by 
(1.14)  y+(t;u,v)  =  Jq  Y(t, v) (f (x(t; v),u(t),t)  - 

f(x(-r;v),  v(t),t)(Jt 

Roughly  speaking,  if  at  time  t,  one  imposes  the  condi¬ 
tion  x(t)  =  x(t;u)  and  then  uses  the  control  v  for  the  re¬ 
mainder  of  the  trajectory,  i.e.,  to  time  T,  one  arrives  at 
x(T;v)  +  y(t;u,v)  at  least  to  a  linear  approximation.  In 
even  rougher  terms,  y+(t;u,v)  is  the  best  approximation  to 
y(t;u,v)  that  an  observer  traveling  along  x(t;v)  can  make 
knowing  only  the  relevant  quantities  evaluated  for  x  on 
the  trajectory  x(t;v). 

Finally,  for  fixed  control  v  we  introduce  the  two  sets 
Ht(v)  =  (y(t;u, v)  /  u  eft] 

H^(v)  =  (y+(t;u,v)  /  ueO}. 

Clearly, 

HT(v)  =  (a  -  x(T;v)  /  a  €  ST(x°) ) 
and  Ht(v)  is  convex  iff  St(x°)  is.  In  general,  of  course, 
neither  are,  but  the  principle  results  of  [10]  are: 

1.15  Theorem:  Under  assumption  A2,  A5,  A  10  and  A  U, 

H+(v)  is  convex  for  0  <  t  <  T  for  any  admissible  control. 
Proof:  [10;  p.94], 

1.16  Theorem:  Under  the  above  hypotheses,  if  y  »  0  is  a 
boundary  point  of  HT(v),  it  is  a  boundary  point  of  H^j(v). 
Proof:  [10;  p.103]. 
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In  the  next  section  we  give  a  derivation  of 
Pontryagins  Maximum  Principle  [15]  based  on  tht  system 
assumed  in  Theorem  1.11  since  that  will  be  sufficient  for 
our  purposes.  The  derivation  based  on  A  10,  A  11  and  Theorems 
1.15  and  1.16  is  very  similar. 
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Maxlrnum  Principle:  The  problem  we  consider  initially 
is  the  following: 

Minimize  Xq(T) 

(2.1)  subject  to  x(0)  -  x° 

xi  (?)  ■  1  ■  1, . .  ,,n 

If  *  f(x>u,t) 

uen 

under  the  assumptions  AS,  At,  A5,  A7,  Aft  A9  of  the  previous 
section . 

Remarks:  f(x,u,t)  is,  of  course,  of  the  very  special  form 


fl  *  Y  Ai(t)  XJ  ui  1  *  • .  .,m 

J-l 

fo  convex  in  (x,u)  but  for  ease  of  notation  we  will  write 
it  in  the  general  form.  Also  in  kj  we  must  assume  the 
convexity  of  R+(x,t)  «  (x/x,  ^  fQ,  Xl  -  ^ . ^  ^ 

r  €  R(x, t))  rather  than  R(x,t)  but  this  does  not  alter  our 
problem. 

Under  these  hypotheses,  s£(x°)  le  bounded,  closed  and 
convex .  Thus  (2.1)  is  reduced  to  a  programming  problem 
in  fundamental  form  (II. 1.1). 


(2.2) 


Max 


s.t. 


UqZ  +  P  *  o 
PeX 
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Where  X  -  (P  /  P  -  x  -  xT,  x  cS^x0)},  xT  -  (0,x^, .  •  •#xnT) . 

The  first  two  conditions  of  (2.1)  can  be  generalized 
by  allowing 

x(0)  c  X°,  (x1(T), . .  .,xn(T))  e  XT  where  X°  and  XT  are 
compact  convex  sets.  Note  X°CEm+1  while  XTcEm.  In  this 
case,  (2.2)  becomes 
(2.3)  Max  Z 

s.t.  UQZ  +  P  +  Q  -  0 

PeX',  Q  *  (0,x1, . .  .,xn)  for  some 
(xx, . .  .,xn)  cXT,  and  X'  -  (P  /  P€ST(X0)).  We  will,  however, 
confine  our  discussion  to  (2.1)  and  (2.2);  the  generalization 
to  (2.3)  is  quite  simple. 

We  assume  that  (2.2)  is  regular.  Then  in  theory  we 

could  apply  the  generalized  simplex  method  to  obtain  the 

* 

optimal  point  p  and  the  dual  variables 


7r,u.  This,  however,  does  not  give  any  Information  about 

#  # 

optimal  trajectories  x  (t)  or  optimal  controls  u  (t).  To 


help  us  do  this,  we  consider  the  following  simple  lemma: 

2.4  Lemma:  If  a  solution  %(t),  of  »  f(x.u.t),- ucft,  x(0)cX° 
belongs  to  the  boundary  of  ST(X°),  then%(t)  belongs  to  the 
boundary  of  S+.(X°)  for  0  <  t  <  T. 

Proof:  Suppose  ^(t^)  is  an  interior  point  of  Sfc  (X°).  Let 
Mt  :En+1-*En+1,  be  the  map  which  takes  points  x  x(t^) 
of  solutions  into  the  value  of  the  solution  at  time  T. 
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x(T),  with  the  control  changed  to  the  sane  control  as  ftrx.  Let  U  be  any 

neighborhood  of  x(t, )  contained  in  S.  (X°)  and  consider 

Z1 

V  =  M.  (U).  Clearly  VcSJX0).  By  A. 1.7  M._1  is  con- 

Z1  L  zl 

tinuous,  hence  V  is  a  neighborhood  of  x(T)  contained  in 
ST(X^)  which  is  a  contradiction  since  x(T)  is  a  boundary 
point  of  ST(X°). 

We  wish  now  to  find,  for  0^  t^T,  a  supporting  hyperptone 
defined  by  n(t)  to  S+(X°)  containing  x  (t)  where  x*  ix  an  optimal 
trajectory.  Clearly  we  want  to  take  n(T)  =  v  where  rr  is 
optimal  for  the  dual  of  (2.2).  We  now  seek  to  determine 
n(t)  for  other  times  t. 

At  time  T  we  have  n(T)[x*(T)  -  x]  £  0  for  all  x  esT(X°). 

*  * 

We  now  consider  x  (t)--x(t;  T,x,u  )  where  x(t;  T,x,u  ) 

denotes  the  solution  of  (0.1)  satisfying  x(T;  T,x,u  )  =  x 
where  the  control  is  u  =  u  .  Making  use  of  (B.3.1)  we  can 
represent  this  difference  by  means  of  solutions  to  the 
variational  equation,  i.e.. 


x  (t)-x(t;  T,x,u  )  =  ||x*(T)  -  x||  h(t)  +o(||x*(T)  -  x||) 
where  h(t)  satisfies 


(2.5)  =  Vf 

dt 


•  h(t) 

x  =  x  (t) 
* 

u  =  u 


where  Vf  is  to  be  evaluated  along  x  (t),u  (t). 
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and  (2.6)  h(T)  * 
optimal  there  must  be  no  vector  x*  and  time  t^  such  that 

x1  -  x*(t1)  +  h(t1)  |  |x1-x#(t1)  j  |  +  o  |  |x1-x#(t1)  |  |€St+^(x°) 

and  n(T)  h(T)  <  0  for  some  solution  of  (2.5).  For  if  this 

i  on  the  lin$  Joining  them 

were  the  case  by  taking  x  close  enough  to  x  (t1)^and  by 

1  # 

changing  the  control  which  achieves  x  to  u  on  t^  t  ^  T 

T  o 

we  would  obtain  a  point,  x  ,  in  ST(x  )  such  that 
n(xT-x  (T))  *  n(T)  (xT-x  (T))  <  0  which  would  be  a  contra¬ 
diction. 

To  help  us  find  points,  x1,  in  St(x°)  we  consider  the 
adjoint  equation  to  (2.5) 

(a.7)  -  -n(t)  7f|x  ,  ,«(t) 

* 

u  •  u 

Solutions  of  (2.7)  have  the  property  that  for  any  solution 
h(t)  of  (2.5)  we  have 

jr  [n(t)  h(t)J  -  n(t)J  h(t)  +  n(t)  i  h(t) 

-  -n(t)  vfh(t)  +  n(t)  vf  h(t) 

-  o. 

As  intimated  by  the  notation,  the  solution,  n(t),  of  (2.7) 
with  the  terminal  condition  n(T)  »  v  is  the  supporting 
hyperplane  'ant.  We  formalize  this  as  the  following 
theorem: 


W  - 


(T)  - 


In  order  for  x  (t)  to  be 


II 


# .  o  * 

2.8  Theorem:  In  order  for  x  (t)  =  x(t:0,x  ,u  )  to  be  an 


optimal  solution  of  (2.1),  there  must  exist  a  solution. 


n  (t),  of  2.7  such  that 

Min  H*(t)x  =  n*(t)  x*(t)  for  0  <  t  <  T. 
x  €  St(x°) 

Proof:  Suppose  n(t1)  [x1-x#(t]L) ]  <  0  for  x1  eSt  (x°),  then 

Xx^  +  (l-X)  x  (t^)  €  S  +  (x^)  for  0  ^  X  £  1.  Let  h(t)  be 

1  1  */t  \ 
the  solution  of  (2.7)  satisfying  h(t.)  =  — x  « ~  1'  . 

|  |  x  -  x  (t^)  |  | 

Then  n ( t ^ )  h(t1)  =  n(T)  h(T)  =  tt  h(T)  0.  Furthermore, 

tt[x(T;  t1,  x1,  u*)  -  x(T;  t1,  x#(t1),  u*)]  = 

ir(h(T)  |  |x1-x*(t1)  ||  o(  |  lx1  -  x#(t1)  |  | )}  <  0 

for  x  sufficiently  close  to  x  (t1)  along  the  line  Joining 

them.  This  Is  a  contradiction  hence  n(t)  [x  -  x  ( t ) ]  >0 

for  all  t  and  xeS+(x°)  and  hence  for  x  St(x0)./// 

Remark:  Theorem  2.8  holds  whether  (2.2)  Is  regular  or  not. 

2.9  Corollary:  The  affine  dimension  of  St(x°)  Is  non- 
decreasing  In  t. 

Proof:  Suppose  n^(T)(x  -  x  (T))  =  0  for  all  xeS^(x0) 

J  =  l,...,r.  Then  n^(t)(x  -  x*(t))  =  0  for  all  xeS+(x°) 

J  =  l,...,r  for  t  <£  T.  Where  S*(x°)  denotes  the  convex 
hull  of  St(x°).  Finally,  if  nJ(T) (x  -  x*(T) )  =  0  for  all 
xeST(x°)  but  there  exists  x1  e  s£(x°)  such  that  (T) (x1-x#(T)  /  0 
we  easily  see  that  S^(x^)  must  be  convex.  Then  we  apply  the 
first  argument  to  S,.„,  rather  than  to  S+,  S fc./// 

Finally  we  come  to  the  celebrated  maximum  principle  of  Rxtrysgin 
(here  because  of  a  choloe  of  sign  it  become  a  'kLnlmun”  principle). 

First  we  must  impose  a  further  condition  on  the  class  of 


admissible  controls. 
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Definition:  Q(t)  Is  said  to  admit  constant  variations 
at  t  if  for  every  uc  Q(t)  and  e  >  0,  there  exists 
uX,6  >  0,  such  that  |  |  u1-u  |  |  <  €  and 

u1€Q(t1)  for  all  t1  such  that  t1  c  I  where  I 
is  either  the  interval  [t,  t+6]  or  [ t- 6 ,  t]. 

Examples:  Q(t)  =  U  a  constant  set,  or  Q(t)  *  U(t), 

a  set  function  monotonic  in  t  are  examples  of  sets  admit¬ 
ting  constant  variations. 

2.10  Theorem:  Assume  that  Q(t)  admits  constant  variations 

for  all  t;  and  that  u(t),  0  ^  t  ^  T  is  an  admissible  con- 

*.  . 

trol  such  that  the  corresponding  trajectory  x  (t)  satisfies 
x*(0)  -  x° 

x^  (T)  *  x^  1  *  l,...,n 

Hr  .  *  *  . 

TF  *  f(x  'u  in  fc* 

#  * 

Then  in  order  for  u  (t)  and  x  (t)  to  be  optimal,  it  is 
necessary  that  there  exist  a  solution,  n(t),  of  (2.7) 
such  that: 

(2.11)  H(E(t),  x* ( t ) ,  u#(t))  =  Min  H(D(t),  x*(t),  u) 

ueQ(t) 

almost  everywhere  in  t  for  0  S.L1.  T,  where 
H(n(t),  x(t),  u( t ) )  =  (n(t)  f ( x ( t ) ,  u(t),t))« 

Furthermore,  n^ft)  ^  0  Is  constant. 

Proof:  We  use  the  same  n(t)  as  in  theorem  2.8. 
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Applying  Theorem  2.8  we  have: 

.  +At  #  #  ^ 

jt  [f(x(r),u,T)-f(x  (t),u  (t),t)]c1t  j  £  ° 

for  any  control  u. 

For  almost  all  t^ 

(2.12)  lim  J^1  f(x  (t),  u*(t),  r)dT 

Ab-*0  1 

=  f(x*(t1),u*(t1)>  tx)  see  [13,  p.255]. 

Suppose  (2.11)  does  not  hold  almost  everywhere.  Then  there 
is  a  point  such  that  (2.11)  does  not  hold,  but  (2.12) 
does.  Let  u1  be  such  that 

n(t1)[f(x*(t1),  u1,  t1)  -  f(x*(t1),  u*(t1),  tj\  <  0. 

Since  Q(t)  admits  constant  variations,  and  by  the  continuity 
of  f(x,u,t)  in  its  arguments  there  exists  6,5^  >  0,  and  u 
such  that 

H(t)[f(x(t),u,t)  -  f(x*(tx),u*(t),  t1)]  <  -6/2 

for  ,t1  £  t  £  t1  +  b1 

(or  t^  +  61  £  t  £  t1  in  which  case  the  modifications  will 
be  left  to  the  reader). 

Then  using  (2.12)  we  have 

-t.+At  # 

Ju  [f(x(t),u,t)  -  f(x(t),u  (t ), t) ]dt 

-  [f (x*(t1),u,t1)  -  f(x#(t1),u#(t1),t1)]At  +  o(At). 
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Multiplying  both  sides  by  n(t,)  and  letting  At  -*  0  yields 

TT 

r-t.+At  *  *  1 

lira  n(t1+At)  |t1  [f(x(t),u,r)  -  f(x  (t),u  (t), t) ]dt  [  <  0 

Ato-0  At  L  1  J 

which  is  a  contradiction. 

That  IIq ( t )  is  constant  follows  from  the  fact  that 

3fi 

S^*0  1  -  0,...,n./// 

We  close  this  section  by  indicating  how  Halkin  proves 

the  maximum  principle  in  [10].  Clearly  for  an  optimal  tra- 

#  # 

Jectory  x  (t)  with  correspcnding  control  u  (t)  we  have  that 

x  (T)  is  a  boundary  point  of  ST(x°).  This  is  equivalent 

to  y  »  0  being  a  boundary  point  to  HT(u  ) .  By  Theorem 

1.16,  y  «  0  is  a  boundary  point  to  HT(u  )  which  by  Theorem 

1.15  is  convex.  Let  v  denote  the  inward  pointing  normal  to 

+  / 

a  supporting  hyperplane  of  H,j(u  )  at  y  »  0.  By  the  defini- 
tion  of  Hp(u  )  we  obtain 

(2.13)  0  £  7T  JqT  Y(t,u*)  ( f (x*(t ) ,  u(t),t)  - 

f(x*(x),  u*(T)fx)dt 

for  all  admissible  control  functions  u.  Let  n(t)  »irY(t,u  ) 
which  satisfies  (2.7).  Pointwise  the  integrand  of  (2.13) 
mu  *•  be  non-negative  almost  everywhere  since  n  by  AD 
obviously  admits  constant  variations,  and  the  previous 
argument  goes  through.  Of  course  in  this  case  H^t)  need 


not  be  a  constant. 
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3.  Transversallty  Conditions;  We  now  return  to  the  problem 
(2.3)  and  show  how  In  a  very  easy  way  we  can  obtain  trans¬ 
versallty  conditions  for  the  initial  and  terminal  points. 

3 . 1  Theorem :  For  (2.3),  the  problem  with  variable  endpoints, 

theorem  2.10  is  valid  and  moreover  we  can  choose  Il(t)  such 

* 

that  n(T)  [x  -  x  (T)]  ^  0,  for  all  x  such  that 

x  ■  (xq*(T)<x1>  ..  .,xn)  where  (xr  . .  ,,xp)  eXT.  n(0)  [*  -  **(o)]  0, 

0  0  T 

for  all  x €  X  ,  whenever  both  X  ,  and  X  are  compact  and  convex. 
Proof :  Given  an  optimal  trajectory  it  must  solve  the  fixed 

endpoint  problem  for  x  =  x  (0),  x  =  x  (T)  hence  2.10  is 

valid.  S0(X°)  is  X°  hence  n(0)(x  -  x  (0)]  >  0,  using  lemma 

2.4.  E(T)[x  -  x*(T)  ]  =  tt[x  -  x*(T)  ]  follows  from  II. 5-5./// 

In  order  to  be  more  specific  about  how  the  simplex 

method  can  be  applied  to  optimal  theory,  we  treat,  in  the 

next  section,  the  special  case  of  linear  control  problems. 
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4.  Linear  Optimal  Control:  We  consider  here  the  following 
problem: 

(4.1)  Min  xq(T) 

s.t.  =  A(t)  x(t)  +  u(t) 

x(0)  e  X° 

(x1(T), . . .,xn(T) e  XT 

u  €  fi  =  (u:uls  measurable ,  and  u(t ) e  Q  ( t ) ) 

Where  A(t)  is  continuous  on  [0,T],  Q(t)  is  convex,  compact 

for  all  t,  0  ^  t  T,  and  as  a  set  function  is  upper  semi- 

continuous  in  t.  Further,  we  assume  that  Q(t)  admits  con- 

0  T 

stant  variations .  X  ,  and  X  are  assumed  to  be  convex  and 
compact . 

Remarks :  f(x,u,t)  *  A{t)x  +  u  is  continuously  differentiable 
with  respect  to  x  and  is  continuous  with  respect  to  t  and 
u.  By  the  results  of  the  previous  section,  St(x°)  is 
closed,  bounded  and  convex  for  each  t,  0  £  t  £  T  .  The 
conditions  for  Flllippov’s  Lemma  1.10  are  also  satisfied. 
Moreover,  we  are  aided  because  we  have  an  explicit 
representation  for  the  solutions  of  4.1  for  various  ueQ 
(B.2.5). 

(4.2)  x(t)  =  Y(t)  x(0)  +  j'^Yft)  Y_1(s)  u(s)  ds,  where 
Y  satisfies  the  matrix  equation 

(it. 3)  -  A(t)  Y(  t)  with  initial  condition 

Y(0)  =  I,  the  identity  matrix. 
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*  * 

If  x  (t),u  (t)  are  optimal  for  (4.1)  then  there  exists  a 
ir  such  that 

ir(x(T)  -  x*(t) )  >  0  for  all  x(T)  €ST(x°)  Using  (4.2) 
we  obtain 

ir[  Y(T)  Y-1(s)  (u(s)  -  u*(s))dsj^O  . 

Letting  H(s)  =  ttY(T)  Y~'*‘(s)  we  observe  (  B.3.6) 
that  n(s)  is  a  solution  of  the  adjoint  equation  for  (4.1). 
This,  of  course,  is  the  same  solution  as  obtained  in  Theorem 
2.8.  For  (4.1)  the  Hamiltonian  H  =  n(t)  f  (t)  = 

D(t)[A(t)  x(t)  +  u(t)]  and  H(x,n,u*)  -  H(x,n,u)  = 
n(t)[u(t)  -  u  (t)].  Hence  the  maximum  principle  2.10 

specialized  to  the  linear  case  becomes: 

* 

4.4  Theorem:  Suppose  that  u  (t),  0  <  t  <  T  is  an  admissible 

*  *  V 

control  such  that  the  corresponding  trajectory  x  (t)  satlsflei 
**(0)  -  *° 

x  ^ (T)  =  x^  i  s  1,  . . .  ,n 

=  ^  +  ^  a,e*  in 

#  * 

Then  in  order  for  u  and  x  to  be  optimal  it  is  necessary 
thsrG  0x1st  3.  solution  TT  ^  -  -n(t)  A(t) 

such  that  0  <  H(n(t),x*(t)  ,u)  -  H(n(t),x*(t),u*(t)) . 

*  n(t)[u  -  u*(t)]  for  all  u  e  Q(t ) 
for  almost  all  t.  Furthermore  ^(T)  £  0. 
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If  we  knew  n(T)  =  v,  we  could  determine 

n(t)  .  irY(T)  Y^(t)  and  then  try  to  determine  u*  by  means  of 
(M)  n(t)  u*(t)  .  „ln  „(t)  u.  If  (4-5)  determlnes  u. 

U  €Q(t) 

uniquely  almost  everywhere  we  could  then  solve  for  u*(t). 

Conditions  which  guarantee  this,  as  well  as  stronger  results 
for  special  cases  of  (4.1)  can  be  found  ln  tll]i  [15;  ch>3). 

In  the  next  section  we  Investigate  ln  some  detail  the  appli¬ 
cation  of  the  generalised  simplex  method  to  a  rather  general 
linear  optimal  control  problem. 
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5.  The  equivalent  enerallzed  program:  We  formulate 

(4.1)  with  varying  endpoints  as  a  generalized  program. 

Max  Z 

(5.1)  s.t.  UQZ  +  A1Pn  +  ^P°  +  vPT  -  0 

x1  -  1 

x2  -  1 

X-,  -  1 


20)  1  -  1,2,3 


P^eX^,  P°eX°,  PT€XT 


where 

X^  «  (  J0T  Y(T)  Y-1(s)  u(s)  ds  /  uen  } 
X°  =  (  Y(T)  x°  /  x°  €  X0} 


XT  =  ((0,il/xTaT].  That  (5.1) 

and  (4.1)  are  equivalent  follows  directly  from  (4.2). 

x  -  x^ex0  ©xT  is  closed,  bounded  and  convex.  If  X  is 
regular  the  generalized  simplex  method  converges  to  the 
optimal  solution.  Suppose  at  step  k  we  have  prices 
k  k  k  k 

ir  ,  M-q  ,M-t  then  the  subproblem  is 

A°  =  Min  (7TkP°  /  P°ex°)  +  HQk 

AT  =  Min  (TrkPT  /  PTeXT)  +  M^k 

Afi  =  Min  (7rkPfi  /  P^cX^}  + 

A  =  min  (A°,AT,An). 

0  T 

The  determination  of  A  and  A  involves  solving  two  convex 

0  T 

programs  and,  for  example,  if  X  ,  and  X  are  polyhedral,  then  the 
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determination  Is  simply  a  linear  program.  Now  we  investi¬ 
gate  in  more  detail  the  determination  of 

(5.2)  An-Hfik  =  Min  (irkPn  / 

=  Min  (irk  JqTY(T)  Y"1(s)  u(s)  ds) 

U€fi 

*  Min  (  JqT  u(s)  ds)  where  nk(s)  =  rr^T)  Y_1(s) 

U€fi 

One  solves  for  the  control  in  the  above,  and  then  intro¬ 
duces  the  corresponding  value  of  the  integral  J0TY(T)  V°(s)u(s&ds 
and  into  the  master  problem  if  A  ®  if1.  In  one  particular 
case,  the  calculations  involved  are  relatively  easily  carried 
out.  That  is  for  the  so  called  "bang-bang"  problem.  Here 
H  ■  (u  :  u(t)  «  B(t)  v(t),  jv^t)  |  £  1  u  measurable) 
where  B(t)  Is  a  continuous  nxr  matrix,  and  v(t)  an  r  vector. 
Since  the  optimal  control  can  always  be  taken  on  the  boundary 
of  Q(t)  j  vi(t)  |  *  1  can  be  assumed  without  loss  of  generality. 
Hence  the  name  "bang-bang".  In  this  case,  the  optimizing 
u  is  given  by  u^t)  «  sgn[n(s)  B ( s )  ] ^  where  "sgn"  is  +1 
or  -1  depending  o.  whether  its  argument  is  positive  or 
non-positive.  In  the  even  more  particular  case  where 
A(t)  -  A,  a  constant  matrix,  then 

n(s)  - 
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k 


TT 


m 

^  Fj(T-s) 
J=0 


e^{T-s) 


where  Xj,  J  =  1,  ...,m  are  the  distinct  eigenvalues  of 
-A  and  F^J  is  a  polynomial  in  (T-s)  of  order  less  than 
the  multiplicity  of  X^  [21;  App.  D].  If  -ATr  has  distinct 
eigenvalues  F  is  a  constant  matrix.  So  in  this  special 
case,  the  ’’pricing  out"  for  can  be  accomplished  more 
or  less  analytically.  In  general,  however,  the  adjoint 
matrix  equation  must  be  solved  numerically,  say  by  discre¬ 
tization,  before  the  algorithm  starts.  In  the  next  section 
we  show  how  the  discretized  problem  can  be  solved  efficiently 
using  the  variant  of  the  simplex  method  outlined  in  section 
1.3. 
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6.  A  computational  method:  We  first  give  the  formal 
details  of  the  computational  method  suggested  by  G.  B. 
Dantzig.  We  then  indicate  how  the  method  can  be  made  more 
efficient  by  use  of  techniques  in  numerical  analysis. 

The  problem  we  consider  is  the  following: 

(6.1)  Min  x0(T) 

S,t-  .  A(t)  x(t)  +  u(t) 

x(0)  e  X° 

(x1(T),...,xn(T))e  XT 
ucft  «  (u(t)  :  u(t)eU(t),  U(t)  a  closed 
bounded  convex  set,  u  measurable 
and  bounded) 

0  T 

X  ,  X  compact  and  convex. 

The  first  step  is  to  replace  (6.1)  by  a  discrete 
analog.  For  convenience,  we  will  divide  the  time  interval 
[0,T]  into  K  equal  parts,  i.e.,  tQ  -  0,  t^  »  At, 
t2  ■  2At,...,tK  ■  KAt  ■  T.  For  large  enough  K  we  can 

replace  gf(tj)  by  where  AxJ  -  x(tJ+1)  -  x(tj). 

If  we  denote  x(t.)  by  x^,  u(t  )  by  u1^,  A(t.)  by  k^  and 

«  J  J 

U(tj)  by  UJ  for  J  -  0, .,..., K,  we  can  replace  (6.1)  by 

(6.2)  Min  x0K 

s.t.  x^’1  -  A^x*1  +’uJAt  J  ■  0,  ...,K-1 
x°€  X° 

(xlK,...,XnK)€XT 
UJ€U(tJ)  J  rnC  .  ,,K, 
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where  A^  =  [I  +  A^Atj.  We  can  solve  (6.2)  In  terms  of 


by  successive  substitutions: 

K^. 

K  rTK-lTK-2  t0,„0  .  \ 


XK  .  [AK*1AK"2...A°]x0  +  ^At[AK-1AK-2...A1+1lu1 


v:here  we  make  the  convention  that  for  i  =  K  -  1 
[kK-h*-2...hM]  =  I»  the  identity  matrix.  Now  letting 

w1  =  (AK’2...Ai+1)u1  and  A*  =  .  .  .J°]  we  obtain 

the  generalized  program 


(6.3) 


Max  Z 


.t.  U0Z  +  vtpt  +  v0P°  +  £  V1  .  0 


(vT.  V  V  *  0 

PT£(P/P0  -  o,  (P1,...,Pn)eXT), 

Or  *  Oi 

Pu  €  (P  :  P  =  A  x,  xeXj 

w1  =  (w1:  w1  =  [AK-1AK“2. . .A1+1]u,  ueU1) 

We  can  then  use  the  generalized  programming  algorithm  to 
solve  this.  Moreover,  (6.3)  has  the  structure  described 
in  1.3,  hence  the  generalized  upper  bounding  techniques 
can  be  applied.  This  has  two  ramifications.  One  is  that 
the  computation  involved  in  this  approach  is  relatively 
insensitive  to  increases  in  K  which  determines  the  accuracy 
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of  the  discrete  approximation.  The  other  point  of  interest 
is  that  Theorem  1.3.2  guarantees  that  at  most  n  +  1  values 
of  w1  (and  hence  u1)  cannot  be  taken  as  extreme  points 
of  their  respective  domains  (see  Theorem  4.4). 

Remarks:  The  method  described  in  this  section  is,  practically 
speaking,  no  different  from  the  approach  described  in  the 

*  /  V 

previous  section.  A  is  simply  Y(T)  calculated  by  the 
method  of  finite  differences,  similarly 

.  .7Ti+1]  is  the  finite  difference  approximation 
to  Y(T)  Y“^(t.),  so  the  format  given  in  this  section  is 
simply  the  implementation  of  the  previous  algorithm  in  a 
form  usable  on  a  digital  computer.  Existing  linear  pro¬ 
gramming  codes  can  easily  be  modified  to  solve  problems 
of  the  form (6.3) rapidly.  Of  course,  few  numerical  analysts 
would  go  about  solving  (4.3)  by  utilizing  the  first  order 
difference  approximation  for  For  fixed  u,  this  approach 

leads  to  errors  of  the  order  of  At  [  17 ;  Sect.  2.24] . 

Other  finite  difference  methods  such  as  the  Runge-Kutta 
method  can  yield  accuracies  on  the  order  of  (At)  .  it 
is  an  easy  exercise  to  see  that  most  of  these  numerical 
techniques  lead  to  mathematical  programs  of  the  same  form 
as  (6.3),  in  particular  the  Runge-Kutta  method  yields  a 
problem  of  that  form  which  contains  K  extra  variables. 
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The  frequent  appearance  of  the  symbol  [L.P.E.]  refer¬ 
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essentially  two  ways  of  interpreting  mathematical  program¬ 
ming  problems  geometrically.  One  is  by  considering  the 
problem  in  the  space  of  the  independent  variables  (decision 
variables,  activity  lavc^,  •’cntrols),  the  other  is  to 
consider  the  problem  in  the  space  of  the  dependent  variables 
(state  variables,  activities,  outputs).  The  second  approach 
is  the  one  followed  here  and  was  actually  used  by  Dantzig 
when  he  first  proposed  the  simplex  method  [L.P.E.,  Section 
T-3].  In  this  geometry,  the  simplex  method  (at  least  for¬ 
mally)  can  easily  be  modified  to  solve  convex  programming 
problems  [L.P.E.,  Ch.  24].  Dantzig,  In  i960,  gave  a  convergence 
proof  for  this  algorithm.  What  I  have  termed  the  generalized 
simplex  method  Is  essentially  that  algorithm  in  a  slightly 
more  general  setting.  Dantzig  arrived  at  his  convex  pro¬ 
gramming  algorithm  as  a  special  case  of  the  generalized 
linear  program  [L.P.E.,  Ch.  22],  In  this  report  we  reverse 
the  process  and  show  that  generalized  programs,  as  well  as 
linear  programming,  the  decomposition  algorithm,  and  many 
problems  in  control  theory  are  special  cases  of  the  convex 
programming  problem. 
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A  well  known  variant  of  the  simplex  method,  also  due 
to  Dantzig  [L.P.E.,  Ch.  18],  is  the  upper  bounding  technique 
for  solving  linear  programs  with  bounded  independent  variables. 
This  was  generalized  in  [4,5}  to  solve  certain  industrial 
scheduling  problems.  In  April  of  1964,  Dantzig  combined 
this  generalized  upper  bounding  technique  with  the  ideas  of 
a  generalized  program  to  propose  an  efficient  method  of 
solving  linear  control  problems.  Chapter  III  of  this  report 
is  essentially  a  development  of  that  suggestion.  A  similar 
approach  can  be  found  in  Cl8,  19,  20] . 

My  contribution,  other  than  whatever  errors  there  might 
be,  is,  I  hope,  twofold.  One  is  to  give  proofs  Justifying 
certain  formal  procedures,  such  as  taking  duals,  and  apply¬ 
ing  the  simplex  method  to  the  generalized  program  and  to 
control  problems.  The  second  aim  waB  to  put  all  the  special 
methods  and  problems  of  mathematical  programming  mentioned 
here  into  a  general  framework. 
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am  also  grateful  to  the  Operations  Research  Center  at  the 
University  of  California  and  the  RAND  Corporation  for  their 
assistance  and  for  providing  an  atmosphere  of  intellectual 
stimulation  these  past  few  years.  Finally  I  would  like  to 
compliment  Miss  Susan  Arbuckle  for  her  courage  and  skill  in 
the  typing  of  this  report  in  face  of  some  very  formidable 
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Appendix  A:  Affin^  Geometry  and  the  Theory  of  Convex  Sets 
1 .  Affine  Geometry 

We  consider  the  group  of  affine  transformations  on  the 
.jn 

vector  space  a  . 

1.1  Def In  1 1 Ions :  y  =  A(x)  =  Tx  +  t  is  an  affine  trans¬ 
formation  from  E™  into  itself  when  T  is  a  mxm  non-singular 
matrix  and  t  any  m- vector.  If  V  c  Em  is  given  by 

V  =  (y  /  y  =  A(x),  xeS]  where  S  is  an  affine  transformation 
and  S  is  a  linear  subspace  of  Em  of  dimension  k  then  V  is 
said  to  be  a  linear  variety  of  dimension  k. 

1.2  Remark:  Clearly  affine  transformations  carry  linear 
varieties  into  linear  varieties  of  the  same  dimension. 

For  if  V  is  a.  linear  variety  and  V  =  [y  *  Tx  +  t  /  xeS; 
and  W  is  given  by  W  =  (z  =  Ey  +  r/yeV]  then 

W  =  [z  =  RTx  +  (Rt  +  r)/  xeS)  which  is  a  linear  variety 
since  the  product  of  two  non-singular  matrices  is  non¬ 
singular  . 

1.3  Definition :  The  p  +  1  points  x  ,...,xp  are  affinely 
Independent  if  x1  -  x°,...,xp-  x^  are  linearly  independent 

as  vectors.  We  note  that  affine  independence  does  not  depend 
on  the  choice  of  x®. 

Corresponding  to  the  situation  for  linear  spaces  we 
wish  to  consider  linear  combinations  of  affinely  independent 
points.  But  we  wish  to  preserve  the  property  that 
(1.4)  A(K0X°  +  ...  +  yP)  =  A0A(x°)  +  ...  +  y(xp) 
for  all  affine  transformations  A.  This  is  not  the  case 
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for  arbitrary  affine  transformations  and  linear  combinations, 
To  see  this,  let  y  =  XqX°  +  ...  +  XpX*3  and  A(x)  =  Tx  +  t 

then  A(y)  =  T(xqx°  +  ...  +  Xp^)  +  t 

=  XqTx°  +  ...  +  ^Tx^  +  t 

*  ZxjtxJ  +  Zv  -  Iv  +  t 

-  Z^(TxJ+t)+t(1-^XJ) 

=  ^  ^  (1  -  Xj )  * 

Hence  the  property  (1.4)  holds  only  if  t  =  0  or  ^  Xj  =  1. 

We  Impose  the  latter  condition. 


1.5  Definition:  y 


x1, . . .,xP  if 


■  IvJ 


is  called  a  centroid  of 


1.6  Theorem:  The  set  V  of  all  centroids  of  points  [xa], 
aeX  is  a  linear  variety  containing  (xa)  and  every  linear 
variety  containing  (xa)  contains  V.  The  linear  variety  V 
is  said  to  be  the  linear  variety  "spanned"  by  (xa). 
Proof:  Let  S  be  the  vector  space  spanned  by  (xa-  x**0} 
for  a €-4,  and  fixed  aQe^,  and  let  V  be  the  linear  variety 
defined  by  V  =  [z/  z  *  Imv'z-  x  °)+  x  0  (z-  x  °)€S}  where 
Im  is  the  mxm  identity  matrix.  Assume  now  that 

Z  ”  Z  V*’  Z  *  1*  Then 

z  -  x  0  ■  Z  *Jx  °  *  Z  "  x  °)»  and  thus 

«0 

Z  -  x  tS  for  all  ZeV,  i.e.,  V  V.  Conversely,  let 
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and 


(z  -  x  °)  =  Y  Xj(x^  "  x°)>  then 

z  =  [  1  -  Xj  ]x  0  +  *jx^  *  Letting  'Xq  =  1  -  Xj 

Xj  =  Xj,  J  /0,  we  have  ^  X ^  =  1.  Hence  VeV  and  therefore  V=V  a 

linear  variety.  If  W  is  a  linear  variety  containing  (xa) 
then  W  =  (y/y  =  A(x)  =  Tx  +  t,  xeS'}  for  some  linear 
subspace  S',  and  affine  transformation  A.  But  the  trans¬ 
formation  =  T-1x-t  is  affine  and  maps  W  into  S'. 

Suppose 

y  =  ^  XjxY  ^  Aj  =  1  is  a  centroid  of  points  in  [xa] 

Then  A_1y  =  ^AjA-1xJ  =  ^XjyJ.  But  A^x^eS  hence 

A-1yeS'.  But  A(A-1y)  =  yeW  finishing  the  proof./// 

1.7  Theorem:  x°,...,xP  are  affinely  Independent  iff 

every  point  y  in  the  linear  variety  spanned  by 
0  P 

x  ,  ...,x  has  a  unique  representation  as  a  centroid  of 

0  p 

x  ,  . . . ,xF. 


Proof:  Suppose  x°, ...,xp  are  affinely  independent  and 


y  -  £«/  .  I°J  “  ISJ 

Z<°j  -  ej> xJ  -  °  an<i  Z  <aj  - 


1.  Then 

0  and  hence 


)  (a.  -  0  * )  (x^  -  x°)  =0  follows. 
J*0 


Uniqueness  means  a 


J 


must  hold  for  J  =  0,1,  ...,p.  Suppose  on  the  contrary,  for 

some  jn  that  a.  *  B  .  Without  loss  of  generality,  suppose 
u  J0  J0 


Jq  *  0,  then 
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X  (aj  "  2j)(x^  -  x^)  Implies  that 

(x1^  -  x°)  are  linearly  dependent  contradicting  the  assump¬ 
tion  that  x°, ...,xp  are  affinely  Independent. 

Conversely,  If  ^  aj(x^  -  x^)  ■  0  and  not  all  a^  *  0, 
then  (Vaj)x°  =  ^aJX*^  whicl1  is  not  unique  since 

J-l 

x°  *  (l  -  ^Oj)x°  +  Ja/.  On  the  other  hand 


J-l 


c°  ,  f  “j 

Liz 


x”  *  )  —  xw  If 

j»l  ^“k 


I 


aj  A  0.  If 


Iaj  -° 


then 


Ia/J  *  XvJ  -  *  °- 


Pick  any  non-zero 

Vk-  -ZajxJ 


aj,  say  «k 
where  » 


,  then 

*Iv 


1*  k 


J*k 


We  then  obtain  the  same  sort  of  contradiction  as  before./// 


1.8  Theorem:  x°, ...,xp  are  affinely  independent  iff  the 

matrix  X  * 

0  p " 

.  .  .  xj 

x°  xp 

^2  •  •  •  a 2 

•  • 

•  • 

■  •  • 

T0  rP 

•  •  •  x- 

m  m 

1  ...  1 

has  rank  p  +1. 

Proof:  Subtract  the  first  column  from  the  other  columns. 
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This  does  not  change  the  rank  of  X. 


1  0  .  .  .  0 


x°,...,xp  are  affinely  independent  iff  the  last  p  columns 
of  X1  are  linearly  Independent  and  clearly  the  first  column 
is  independent  of  the  rest./// 

For  example,  two  points  are  affinely  dependent  iff 
they  are  the  same  point,  three  points  are  affinely  dependent 
iff  they  are  collnt^r,  and  four  points  are  affinely  dependent 
iff  they  are  coplanar. 

1.9  Definition:  Let  A  be  any  subset  of  Em.  Its  linear 
dimension  l. d.(A)  is  the  dimension  of  the  smallest  subspace 

containing  A  and  its  affine  dimension,  a.d.  (A)  ,  is  the 
dimension  of  the  smallest  linear  variety,  VA,  containing  A. 

1.10  Lemma;  a.d. (A)  is  equal  to  the  maximum  number  of 
affinely  independert  points  of  A  less  one. 

Proof:  VA  is  simply  the  linear  variety  spanned  by  A  (A. 6). 
Suppose  the  maximum  number  of  affinely  independent  points 
Is  p  +  1  and  let  x®,  ...,xp  be  affinely  independent.  Consider 
the  matrix  defined  in  1.8. 


Ill 


"’he  last  p  columns  are  linearly  independent  and  hence 
y1  *  -  x°  fbr  J  *  l,...,p  span  a  p-dlmenslonal  vector 

space,  and  the  affine  transformation  y  =  x  +  x°  maps  the 
linear  subspace  onto  V^.  So  has  affine  dimension  at 
least  equal  to  p.  Suppose  there  exists  zeVA  such  that 
z  -  x°  is  independent  of  the  (y^)^  then 


has  rank  p  +  2  yielding  a  contradiction./// 
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2.  Convex  Sets 

1  2 

2.1  Definition :  A  set,  c,  is  said  to  be  convex  if  x  ,x  ec 

1  2 

implies  Ax  +  (l-  A)x  ec  for  all  0  <  X  <  1. 

2.2  Lemma:  If  C  Is  convex  and  x^,...,xP€C  then 

xjxi.g.c  if  >,  -i -  X j 

Proof :  By  induction  on  p.  For  p  <  2  obvious.  Suppose 

true  for  p  <  P.  Then 

e  C  and 


Pf^  A,xJ 


J=1  £  v, 
j=i  J 


p 

*  T  yJc  c  •/// 

j=i 

The  relative  1 iterlor  of  a  subset  A  of  Em  is  the  in¬ 
terior  of  A  relative  to  the  linear  variety  of  lowest 
dimension  containing  A. 

2.3  Theorem:  A  non-empty  convex  set,  C,  has  relative 
interior  points. 

Proof:  Let  k  =  a.d.(c)  and  suppose  x°,  ...,xkec  and  are 
affinely  independent.  Then  the  "simplex" 

S  =  [x  /  x  =  \  A^x  ,  V  A,  -  1,  A,  *  0}  contains  an  open 
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set  in  the  relative  topology  and  clearly  Sec.  To  see  that 
S  has  an  interior,  let  y  *  xl*  Por  any  sequence  of 
points  in  V,  y^  -*  y  such  that 

yl-  Zxix1'  hi-1  } 


1  2 

J.,  c,  .  •  .  , 


i 

we  have  by  continuity  that  for  some  J,  J  £  J  implies 
A^  >0  i  *  0, . . . ,k./// 

2.4  Definitions:  C  c  E"1  is  a  cone  with  vertex  at  the 
origin  if  OeC  and  if  xeC  then  AxeC  for  all  A  ^  0.  Any 
affine  transformation  of  a  cone-with-vertex-at-the-origin 
is  simply  called  a  cone .  A  ray,  R(x,y),  is  a  cone  of  the 
form  Ft(x,y)  =  [(l-A)x  +  Ay/ A  ^  0)  for  some  fixed  x  and  y, 
x  y  y. 


H  *  (x/ax  =  b,  a  /  0,  xeE1")  is  called  a  hyperplane 
in  E1"  and  S  =  (x/  ax  ^  b,  a  /  0,  xcE111}  is  called  a  closed 
half-space  or  for  our  purposes  simply  a  half-space.  If 
for  any  set  KcEm  we  have  ax  >  b  for  all  xeK  we  say  that 
H(S)  is  a  bounding  hyperplane  (half-space)  of  K.  If  more¬ 
over  there  exists  in  K  a  point  x  such  that  ax  ■  b,  then 
H(S)  Is  said  to  be  a  supporting  hyperplane  (half-space). 

2.5  Lemma:  Half-spaces  and  hyperplanes  are  convex. 

Proof:  Obvious. 

2.6  Lemma :  The  intersection  of  arbitrarily  many  convex 
sets  is  convex. 


Proof:  Obvious. 
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2.7  Definition :  The  Intersection  of  a  finite  number  of 
half-spaces,  K,  is  called  a  convex  polyhedral  set. 

By  2.6,  a  convex  polyhedral  set  is  convex. 

2.8  Definitions :  The  Intersection,  H(S),  of  all  convex 

sets  containing  a  set  S  is  called  the  convex  hull  of  H(S). 
n 

If  y  =  X  y  =  1  then  y  is  said  to  be 

J=1 

a  non-negative  centroid  of  the  x^  j  =  l,...,n. 

2.9  Lemma:  H(S),  the  convex  hull  of  a  set  S,  is  equal 
to  S',  the  set  of  all  non-negative  centroids  of  points 
In  S. 

Proof:  Clearly  S  c  S'  since  x  =  1  •  x  eS  for 

xeS.  On  the  other  hand.  S'  c  H(S)  since 

y  Aj  >  0,  y  Aj  =  1  belongs  to  every  convex 

set  containing  S  by  an  argument  similar  to  the  one  in  1.6./// 
We  give  now  a  few  facts  from  the  theory  of  convex 
polyhedral  sets  without  proof  since  that  would  take  us  too 
far  afield. 

2.10  Theorem:  A  bounded  set  S  is  a  convex  polyhedral  set 
iff  it  is  the  convex  hull  of  a  finite  number  of  extreme 
points . 

Proof:  [8  ]  • 

2.11  Definition:  A  point  x°  belonging  to  a  convex  set, 

C,  is  an  extreme  point  of  C  if 
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x°  =  AqX0  +  ^  Aj  £  0,  x^eC,  /  x°  £  Xj  •  1  implies 

that  Xq  =  1,  Aj  =  0,  J  /  0. 

2.12  Theorem:  Let  C  c:  e”1  be  a  closed  convex  cone  with 
vertex  at  the  origin.  If  d^  is  the  dimension  of  the  largest 
subspace  contained  in  C,  and  d^  is  the  dimension  of  the 
smallest  subspace  containing  H  =  [tt  /  ttx  ^  0  for  xeC 

then  +  d2  =  m. 

Proof:  [  6  :  p.12] 

2.13  Theorem:  If  P  is  a  convex  polyhedral  set  and  x°  is 
an  extreme  point  of  P,  then  the  cone  C(x°)  «  [tt  /  there 
exists  "a"  such  that  ttx  ^  a  determines  a  support  of  P  and 
ttx0  =  aj  has  a  non-empty  interior. 

Proof:  Consider  Pf  =  (y-  x0/  yep) .  Let  C*  •  (y/y  «  Ax, 

A  >  0,  xeP  }  irx  ^  0  for  all  xeP  iff  ttx  £  0  for  all  xeC 1 . 
Clearly  0  is  an  extreme  point  of  P  therefore  in  theorem 
2.12  we  note  that  d^C1)  =  0.  Hence  d2(C)  =  m.  Theorem 
2.3  completes  the  proof./// 

Now  returning  to  the  question  of  general  convex  sets  we 
prove : 

2.14  Theorem:  Let  K  be  a  closed,  bounded,  convex  set  and 
let  C  =  (y  /  y  =  Ax,  A  >  0,  xcKj.  C  is  closed. 

Proof:  Suppose  AjX*^  converges  to  y  where  Aj  >  0,  x^eK, 

J  »  1,2, ....  We  must  show  that  yeC.  K  is  compact  hence 
(x^J  has  a  limit  point  x.  Consider  a  subsequence  x^1- 
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1  =  1,2,...  which  converges  to  x,  and  the  sequence  X,  x. 

1 

Since  x°  are  bounded  X.  are  bounded  and  X.  has  a  limit 

J  J 

call  It  X.  Clearly  X  >  0  and  y  =  Xx./// 

2.15  Definition:  Suppose  K  Is  a  closed  convex  set.  Let 
x  be  any  point  of  K  and  let  C  =  (y  -  x  /  R(x,y)eK)  we  call 
C  the  characteristic  cone  of  K.  R(x,y)  Is  the  ray  with 
origin  x  going  through  y.  The  definition  is  Justified 

by  the  following  theorem. 

2 . 16  Theorem :  The  set  C(x)  =  (y  -  x  /  R(x,y)eK)  is  in¬ 
dependent  of  x,  xeK,  K  closed  and  convex. 

Proof:  First  we  show  that  R(x,y)cK  iff  there  exist  xeK, 
y^cK  such  that  | |  |  j  -*■  “  and  limit 


yJ  -  x 


1 1  yJ- x  || 
we  obtain  the  only  if.  Suppose  lim 


,  r  .  Letting  yJ  =  x  +  J(y-  x) 

y  -  x 


yu  -  x 


y  -  x 


y  -  x 
Z  =  x  +  X 


where  y^eK,  j  I  y  ^ 


1 1 yJ -  x  1 1 
oo.  Let  ZeR(x,y),  then 


h  -  *) 

y  -  x 


for  some  X 


o  y 


For  X 


y°  -  x 


we  have  that  Z,(X)=x+  eK.  But  Z  -  Z.(X  ) 

«  I  [y vl  I  J  U 


v  f  y  -  *  -  -3L-.--JL 

0  N  y  -  x  1 1  I |yJ  -  x| I 


y  .  Hence  Z j ( X^ )  approaches  Z 


and  since  K  is  closed  ZeK  and  since  Zn  was  arbitrarily  chosen, 

1  u  1 


R(x,y)cK.  But  the  limit 
is  Independent  of  x./// 


y"  -  X 
yJ-  x 


=  lim 


-y‘ 


J 


is 


2.17  Lemma:  Let  K  be  a  closed  convex  set  and  y^K. 


117 


Then  there  exists  tt  such  that 
lnf((ir,x)  /  xeK)  <  Try. 

Proof:  The  euclidean  norm  |  |  y  -  x  |  |  is  a  continuous  func¬ 
tion  of  x  hence  it  attains  its  minimum  in  K,  at  say  x°. 
Clearly  x°  is  a  boundary  point  of  K.  Let  tt  =  x°  -  y  . 

Try  =  -(x°  -  y)(x°-  y)  +  tt  x^  <  tt  x°  since  y  ft  x°.  On 

the  other  hand  if  xeK,  then  x°  +  X(x  -  x^)eK  for 

0  <  X  <  1.  By  the  definition  of  x®  we  have 

I  |  y  -  x°  I  I  <  I  I  (y  -  x°)  -  X(x  -  x°)  I  I  for  0  <  X  <  1. 

Squaring  we  obtain  after  cancellations  and  dropping  the 

common  factor  X  >  0,  0  <  2(y  -  x°)(x  -  x°)  -  X(x  -  xQ)2 

letting  X  —  0  we  have  (x°  -  y )  (x  -  x°)  <_  0  hence 
t  x  ^  ttx°  <  iry./// 

2.l8  Theorem:  Let  K  be  a  convex  set  and  y  a  boundary  point 
then  there  exists  tt  such  that  tt  y  =  inf  (  tt  x  /  xeK),  l.e., 
through  every  boundary  point  of  K  there  passes  a  supporting 
hyperplane . 

Proof :  Let  y^  approach  y  such  that  y*^  £  K.  Then  there 
exists  tt^  satisfying  Lemma  2.17.  We  can  assume  without 
loss  of  generality  that  |  |  ir^  |  |  »  1  hence  there  exists 
tt  which  is  an  accumulation  point  of  the  tt^.  Since 
^(X-  y1^)  >  0  for  all  J  and  xeK  we  have  tt(x  -  y)  ;>  0./// 
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B.  Ordinary  Differential  Equations: 

0.  Introduction:  Contained  in  this  appendix  are  classical 
results  from  the  theory  of  ordinary  differential  equations. 
References  will  be  given  where  the  proofs  of  these  results 
can  be  found. 

1.  General  Theory:  When  confronted  with  a  differential 

equation  ^  =  f(x,t)  one  often  considers  the  related 

integral  equation  x(t)  =  x(o)  +  JJ'  f (x,t)dt.  The  first 

dx 

equation  implies  that  ^  exists  at  every  point  of  Interest 

while  the  Lebesgue  theory  of  Integration  merely  guarantees 

that  solutions  of  the  integral  equation  are  absolutely 

continuous.  To  make  the  Identification  more  complete,  it 

d  x 

is  convenient  to  Interpret  ^  =  f(x,t)  as  holding  only 
almost  everywhere  (which  we  abbreviate  a.e.).  In  particular 
we  will  study  the  following  problem: 

Find  an  absolutely  continuous  function  x(t)  defined 
on  I  =  [t^,  t ^ ]  such  that 
(x(t),t)  erf  c.  En  x  T 

y 

(1.1)  =  f(x(t),t)  a.e.  on  I, 

and  x(t)  =  S  for  some  fixed  Tel. 

1.2  Remark:  Since  x(t)  Is  absolutely  continuous.  It  is 
differentiable  a.e.  [  9  ;  pp. 203-205]. 

1.3  Theorem:  Let  f  be  defined  on 
R[a,b,t1,t2]  =  (Xl,...,xn,t  /  a1  <  X;L  <  bL,  1  - 
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t1  *  t  <  tg>  wherjs  <  1  *  1  >  . ...  n  and  <  tg  and 

suppose  It  is  measurable  In  t  for  each  fixed  x  and  continuous 
In  x  for  each  fixed  t.  If  there  exists  a  Lebesgue-lntegrable 
function  m(t)  on  the  Interval  $  t  *  tg  such  that 
l|f(x,t)M  *  m(t)  for  (x,t)eR[a,b, t-,  ,tg]  then  there  exists 
a  solution,  x(t),  to  (1.1)  on  some  Interval 

tj_£_t j_JL_t  £  tg  z  to  satisfying  x(t)  -  i  ,  for  a^.  i  i  . 

1.4  Theorem :  In  an  open  connected  domain  2)  c  En  x  T,  let 
f  be  defined,  measurable  In  t  for  fixed  x  and  continuous 

In  x  for  fixed  t.  Let  m  be  an  lntegrable  function  such  that 
1 1  f  (x, t )  1 1  <  m(t)  for  (x,t)  e  2)  .  Then  given  a  solution, 
x(t),  to  (1.1)  on  a  non-empty  Interval  tj.  <  t  <  tg.  It  can 
be  extended  to  the  boundary  of  2>  . 

1.5  Definitions:  A  function  $(x)  Is  said  to  satisfy  a 
Llpshltz  condition  with  constant  K  on  the  domain  D  If 

||  $(x^ )  -  4>  ( x2 )  |  |  <  K||x1  -  x2  ||  for  all  x-^Xg  e  D. 

4>(x)  Is  said  to  satisfy  a  Llpshltz  condition  locally  if,  on 
every  neighborhood  in  D,  $(x)  satisfies  a  Llpshltz  condi¬ 
tion  for  some  constant  K  which  may  depend  on  the  neighborhood. 

1.6  Theorem :  If,  In  addition  to  the  hypotheses  of  1.3, 
f  also  satisfies  a  Llpshltz  condition  in  x  locally,  then 
the  solution  Is  unique. 

1 .7  Theorem:  Suppose  f  -  f(x,t,u)  Is  a  function  of 
xeEn,  teT,  and  ueE10  on  a  domain  Z>^  -  £>x  1^  where  I  - 

{yi/  |u-ng  |  <  C  },  C  >  0.  Suppose  f  Is  measurable  in  t 

t 

C 

1 
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for  fixed  m-  and  x;  continuous  In  x  for  fixed  t  and  p.  and 
for  fixed  t,  f  is  continuous  In  (x,p.)  at  4  =  ^ .  Moreover 
we  assume  there  exists  a  Lebesgae  lntegrable  function  m(t), 
for  <  t  <  tp,  such  that  ||  f(x,t,n.)  ||  <  m(t),  uniformly 
In  x, p.  for  ,  Then  If  for  m-  =  dQ 

g£  =  f(x,t,40),  x (t )  =  £ 

has  a  unique  solution  on  where  xeft-^tp]  there 

exists  6  >  0  such  that  for  any  fixed  (o,t),m.)  satisfying 
lCT-'rl  +  lln-4||  +  ||ti-ix0||<6, 
all  solutions  x(t,a,T),M-)  of 
gg  =  f(x,t,p.),  x(a)  =•-  t] 

exist  on  [t^t^ _ Moreover  as  (o,t),|J.)  -♦  (t,£,h,0) 

x(t,c,  —  x(t,Tf  ^,tJLn)  uniformly  on  [t1,t0]  . 

Proofs:  Proofs  of  theorems  1.3,  1.4,  1.6,  and  1.7  can  be 
found  in  [  1  ] . 


* 
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2,  Linear  Differential  Equations;  In  this  section  we 
apply  the  theorems  of  the  previous  section  to  linear  dif¬ 
ferential  equations  and,  in  addition,  obtain  some  special 
results . 

First  we  consider  the  homogeneous  linear  differential 
equation: 

(2.1)  — =  A ( t )  y(t)  ,  where  A(t)  is  a  n  x  n  matrix 

of  functions  of  t.  If  | |  A (t )  ||  <  M  on  some  time  inter¬ 
val  [t,,tg],  then  A(t)  y(t)  satisfies  a  Lipshltz  condition 
on  I  =  [t^tg]  with  constant  M.  Applying  theorems  1.3, 

1.4  and  1.6  we  obtain: 

2.2  Theorem:  If  each  element  of  A  is  measurable  on 
I  =  [tjjtg]  and  |  |  A (t )  | |  <  m(t)  on  I  where  ra(t)  Is 
Lebesgue  lntegrable  then  for  any  £eEn  there  exists  a 
unique  solution,  y(t),  to  (2.1)  (l.e.,  (2.1)  holds  almost 
everywhere)  with  y(t)  =  §  on  the  interval  I. 

Obviously  y(t)  =  0  is  a  solution  of  (2.1),  so  by 
uniqueness  If  y(t)  »  0  for  any  tel,  y(t)  =  0  on  I.  Let 
y^(t)  be  the  solutions  of  (2.1)  with  Initial  conditions 
y^(f)  «  e^  where  e\...,en  is  any  basis  for  E11.  Let  y(t) 
be  any  solution  of  (2.1).  Suppose  y(t)  =  £.  By  the  def¬ 
inition  of  e\...,en  there  exists  ^^,...,^n  such  that 

Ajy^(t)  we  have 

Z(t)  *  o,  hence  since  Z(t)  is  obviously  a  solution  of 

(2.1) ,  Z(t)  =  0  on  I.  This  essentially  proves: 


I 


V 


If  we  let  S 


y(t)  -  l 
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2.3  Theorem:  The  solutions  of  (2.1)  In  I  form  a  vector 
space  (of  functions)  of  dimension  precisely  n. 

Now  we  turn  to  the  non-homogeneous  problem. 

(2>20  =  x(t)  +  ^(t)  a.e., 

where  A(t)  Is  a  nxn  matrix  of  functions  and  b(t)  is  a 
n- vector  of  functions.  In  this  case  we  can  exhibit  a  solu¬ 
tion  to  (2.4)  if  we  know  a  basis  of  solutions  to  the 
"reduced  system"  (2.1). 

2.5  Theorem:  If  A(t)  and  b(t)  are  lntegrable  functions 
of  t  on  I  and  |  |  A(t)  | |  <  m(t),  |  |  b(t)  | |  <  m(t),  where 
m(t)  is  Lebesgue  lntegrable  on  I  then  for  any  point  £eEn 
and  T€l  (2.4)  has  a  unique  solution  such  that  x(t)  =  g 
of  the  form 

(2.6)  x(t)  =  Y(t)5  +  Y(t)  Y- 1  ( s )  b ( 8 )  da 
where  Y(t)  is  the  matrix  solution  of 

-  A(t)  Y(t ) 

Y(t)  =  I  the  n  xn  identity  matrix. 

The  proofs  of  theorems  in  this  section  can  be  found  in 
[  2  ;  Ch.  3]. 
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3.  The  variational  equations  associated  with  a  system  of 
ordinary  differential  equations  and  the  corresponding  adjoint 
system:  The  principal  theorem  we  need  here  is  the  following: 
3.1  Theorem:  Let  f(x,t,ii)  satisfy:  (a)  For  every  fixed 
value  p.  In  some  neighborhood  of  the  point  the  com¬ 
ponents  of  f(x,t,p.)  are  measurable  as  functions  of  t  for 
fixed  x,  continuous  In  x  for  fixed  t  satisfying 
t1  <  t  <  t2  and 

||  f(x,t,*i)  ||  M(t, |jl)  for  tj.  <  t  <  tg,  iicU^  where 

M(t,p.)  Is  integrable  In  t  for  fixed  p.. 

(b)  Along  a  solution 

x°(t)  -  x(t,nQ),  x°(tQ)  *  x(t0,nQ)  -  *0(n0) 
f(x,t,ii)  Is  differentiable  In  (x,|j.). 

(c)  If  we  let 


r  •  V M2  +  |h'  I2 

then  whenever  +  h  lies  In  we  have 

f (x  +  h * ,  t,  Hq  +  h)  -  f(x,t,Hp) 
r 


£  K(t) 


where  K(t)  Is  a  summable  function  on  <  t  <  tg* 

Then  If  the  solutions  of 
(3-2)  t,  n) 

x(tQ;n)  -  x°(n) 

are  differentiable  In  4  at  the  point  t  -  t^,  m-  *  they 
are  differentiable  In  4  for  arbitrary  values  of  t  in  the 


region  t^  <  t  <  1 2>  p.  =  p.  ,  and  their  derivatives  with 
respect  to  4  satisfy  the  "variational  equations”: 

(3.3)  §^  =  'i7xf,*y+'|“2  where  y^f  is  the  matrix  of  partial 

if 

derivatives  of  f  =  (xt~)  and  and  y-  are  evaluated 

J  ^ 

along  the  solution  x(t,p.0). 

(3.3)  is  called  the  variational  equation  associated 
with  (3.2).  An  important  special  case  arises  when  only 
the  initial  point  x®(p)  depends  on  p  yielding  the  homo¬ 
geneous  variational  equation 

(3.4)  U  =  v  f  I  •  y 

^  x  l*(t)-*(t,n0) 

Since  we  will  have  no  occasion  to  use  (3.3)  we  simply  call 

(3.4)  the  variational  equation  associated  with  (3.2). 

Solutions  of  (3. 4)  can  be  Interpreted  as  the  incremental 
change  to  the  trajectory  x(t,p0)  which  results  from  chang¬ 
ing  the  initial  point  from  x°(p0)  to  x^((jl).  Formally, 

(3.3)  and  (3.4)  can  be  derived  by  using  the  uniformly  con¬ 
tinuous  dependence  of  the  solution  to  differential  equations 


on  the  initial  values  and  f  to  make  a  first  order  approxi¬ 
mation  to  the  incremental  change  in  the  trajectory  (Theorem 
1.7).  Then  if  f  is  expanded  in  a  Taylor  series  with  respect 
to  x  and  p,  the  desired  equations  can  be  obtained  (although 
stronger  assumptions  are  needed  on  f  for  this  approach). 

The  extension  of  Theorem  3.1  to  the  case  where  p  is 
a  vector  can  also  be  carried  out  in  which  case  (3.3)  becomes 
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<3-5>  If  -  V  y+ V- 

Now  let  us  return  to  variations  only  in  the  initial 
point.  We  take  pcE11  and 

x°(u)  -  (x°  +  +  un  )  and  «  0.  Then  for  each 

t,  t1  <  t  <  t2,  y(t;u)  defines  through  (3.4)  a  linear 
homogeneous  transformation  of  the  n  dimensional  u-space 
(or  equivalently  the  y  space  at  time  t^)  onto  the 
n-diraensional  y-space  at  time  t.  Let  a  hyperplane  in 
the  \i  space  passing  through  the  origin  be  determined  by 
ir(tQ)u  «  0  or  equivalently  ir(tQ)  y  (tQ,M.)  -  0 
This  hyperplane  is  transformed  at  time  t  to  a  new  hyper¬ 
plane  determined  by  ir (t )  y  (t;p.)  »  0.  To  find  ir(t)  it 
suffices  to  find  ir(t)  such  that  tr(t )  y  (t,p.)  is  constant 
in  time  for  each  fixed  p..  For  this  to  occur,  the  following 
must  be  satisfied: 

(3.5)  C  -  ^  ir(t) y  (t,u)  -  fr(t)  y  (t,p)  +  r(t)  y  (t,u) 

-  *(t)y  (t,ji)  +  Tr(t)(7xf)  y(t,u) 

The  above  equation  can  be  inverted  (Theorem  2.3)  by  choos¬ 
ing  J  -  1,  ...,n  linearly  Independent.  Thus  if 

(3.6)  -  -ir(t)  Vxf  for  7xf  evaluated  along  the  solu¬ 
tion  x(t,0)  is  satisfied  then  so  is  (3.5).  (3*6)  is  called 
the  adjoint  equation  for  (3.4). 

The  proofs  required  for  this  section  can  be  found  in 
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